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(54) High resolution crystal structure of the ribosome and design of protein synthesis inhibit rs 



(57) The invention provides methods for producing 
high resolution crystals of ribosomes and ribosomal 
subunits as well as crystals produced by such methods. 
The invention also provides high resolution structures 
of ribosomal subunits eitheralone or in combination with 
protein synthesis inhibitors. The invention provides 
methods for identifying ribosome-related ligands and 



methods for designing ligands with specific ribosome- 
binding properties as well as ligands that may act as pro- 
tein synthesis inhibitors. Thus, the methods and com- 
positions of the invention may be used to produce lig- 
ands that are designed to specifically kill or inhibit the 
growth of any target organism. 
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Description 

RELATED APPLICATIONS 

5 [0001] This application is a continuation-in-part of U.S. Application No. 09/635.708. filed August 9. 2000. and claims 
the benefit of (i) U.S. Provisional Application No. 60/223.977. filed August 9, 2000. (ii) U.S. Provisional Application No. 
[Atty. Docket No. RIB-002PR). entitled "The Kink-Turn: a New RNA Secondary Structure Motif." filed July 20. 2001 : 
and (iii) U.S. Provisional Application No. [Atty. Docket No. RIB-002PR2]. entitled "The Kink-Turn: a New RNA," filed 
August 1. 2001, the disclosures of each of the foregoing of which are incorporated by reference herein. 

10 

GOVERNMENT LICENSE RIGHTS 

[0002] Certain work described herein was supported, in part, by Federal Grant Nos. NIH-GM22778 and 
NIH-GM5421 6. awarded by the National Institutes of Health. The Government may have certain rights in the invention. 

15 

FIELD OF THE INVENTION 

[0003] The present invention relates generally to the field of protein biosynthesis and to modulators, for example, 
inhibitors, of protein biosynlhesis. More particularly the invention relates to methods and compositions for elucidating 
20 the three-dimensional structure of the large nbosomal subunit. either alone or in combination with a protein synthesis 
inhibitor, the three-dimensional structure of the large ribosomal subunit ; either alone or in combination with a protein 
synthesis inhibitor; the use of such structures in the design and testing of novel protein synthesis inhibitors: and novel 
protein synthesis inhibitors. 

25 BACKGROUND 

1. Ribosomes: Structure. Function, and Composition 

[0004] Ribosomes are ribonucleoproteins which are present in both prokaryotes and eukaryotes. They comprise 
30 about two-thirds RNA and one-third protein. Ribosomes are the cellular organelles responsible for protein synthesis. 
During gene expression, ribosomes translate the genetic information encoded in a messenger RNA into protein (Garrett 
et al. (2000) "The Ribosome: Structure. Function, Antibiotics and Cellular Interactions. "American Society for Micro- 
biology. Washington. D.C.). 

[0005] Ribosomes comprise two nonequivalent ribonucleoprotein subunits. The larger subunit (also known as the 

35 "large ribosomal subunit") is about twice the size of the smaller subunit (also known as the "small ribosomal subunit"). 
The small ribosomal subunit binds messenger RNA (mRNA) and mediates the interactions between mRNA and transfer 
RNA (tRNA) anticodons on which the fidelity of translation depends. The large ribosomal subunit catalyzes peptide 
bond formation -- the peptidyl-transferase reaction of protein synthesis -- and includes (at least) two different tRNA 
binding sites: the A-site which accommodates the incoming aminoacyl-tRNA, which is to contribute its amino acid to 

-to the growing peptide chain, and the P-site which accommodates the peptidyl-tRNA complex, i.e., the tRNA linked to all 
the amino acids that have so far been added to the peptide chain The large ribosomal subunit also includes one or 
more binding sites for G-protein factors that assist in the initiation, elongation, and termination phases of protein syn- 
thesis. The large and small ribosomal subunits behave independently during the initiation phase of protein synthesis; 
however, they assemble into complete ribosomes when elongation is about to begin. 

45 [0006] The molecular weight of the prokaryotic ribosome is about 2.6 x 1 0 6 daltons. In prokaryotes, the small ribos- 
omal subunit contains a 16S (Svedberg units) ribosomal RNA (rRNA) having a molecular weight of about 5.0 x 10 5 
daltons. The large ribosomal subunit contains a 23S rRNA having a molecular weight of about 1.0x 10 6 daltons and 
a 5S rRNA having a molecular weight of about 4.0 x 10 5 daltons. The prokaryotic small subunit contains about 20 
different proteins and its large subunit contains about 35 proteins. The large and small ribosomal subunits together 

so constitute a 70S ribosome in prokaryotes. 

[0007] Eukaryotic ribosomes generally are bigger than their prokaryotic counterparts. In eukaryotes, the large and 
small subunits together make an 80S ribosome. The small subunit of a eukaryotic ribosome includes a single 18S 
rRNA. while the targe subunit includes a 5S rRNA, a 5.8S rRNA. and a 28S rRNA. The 5.8S rRNA is structurally related 
to the 5' end of the prokaryotic 23S rRNA. and the 28S rRNA is structurally related to the remainder of the prokaryotic 

55 23S rRNA (Moore (1998) Annu. Rev. Biophys. 27: 35-58). Eukaryotic ribosomal proteins are qualitatively similar to the 
prokaryotic ribosomal proteins: however, the eukaryotic proteins are bigger and more numerous (Moore (1 998) supra). 



BNSDCCID <EP 1166769A2 I 



2 



EP1 188 769 A2 



II. Structural Conservation of the Large Ribosomal Subunit 

[0008] While the chemical composition of large ribosomal subunits vary from species to species, the sequences of 
their components provide unambiguous evidence that they are similar in three-dimensional structure, function in a 

5 similar manner and are related evolutionary. The evolutionary implications of rRNA sequence data available are 
reviewed in the articles of Woese and others in part II of Ribosomal RNA. Structure. Evolution. Processing and Function 
in Protein Biosynthesis . (Zimmermann and Dahlberg. eds.). (CRC Press. Boca Raton. FL. 1996). The anicle by Garret 
and Rodriguez-Fonseca in part IV of the same volume discusses the unusually high level of sequence conservation 
observed in the peptidyl transferase region of the large ribosomal subunit. The ribosomes of archeal species like Haioar- 

10 cuia marismortui resemble those obtained from eubacterial species like E. coli in size and complexity. However, the 
proteins in H. marismortui ribosomes are more closely related to the ribosomal proteins found in eukaryotes (Wool et 
ai (1995) Biochem. Cell Biol. 73: 933-947). 

III. Determination of the Structure of Ribosomes 

15 

[0009] Much of what is known about ribosome structure is derived trom physical and chemical methods that produce 
relatively low-resolution information. Electron microscopy (EM) has contributed to an understanding of ribosome struc- 
ture ever since the ribosome was discovered. In the 1970s, low resolution EM revealed the shape and quaternary 
organization of the ribosome. By the end of 1 980s : the positions of the surface epitopes of alt the proteins in the £. coli 

20 small subunit. as well as many in the large subunit. had been mapped using immunoelectron microscopy techniques 
(Oakes et ai (1986). Structure. Function and Genetics of Ribosomes . (Hardesty, B. and Kramer. G , eds.) Springer- 
Verlag. New York. NY. pp. 47-67; Stoeffler et ai ( 1 986). Structure. Function and Genetics of Ribosomes . (Hardesty. B. 
and Kramer. G.. eds.) Springer-Verlag. New York. NY pp.28-46). In the last few years, advances in single-particle 
cryo-EM and image reconstruction have led to three-dimensional reconstructions of the E. coli 70S ribosome and its 

25 complexes with tRNAs and elongation factors to resolutions of between 15 A and 25 A (Stark et ai (1995) Structure 
3 . 815-821: Stark era/. (1997) Nature389B: 403-406: Agrawal etal. (1996) Science 271 : 1 000-1 002: Stark etai (1997) 
Cell2Q: 19-28). Additionally, three-dimensional EM images of the ribosome have been produced at resolutions suffi- 
ciently high so that many of the proteins and nucleic acids that assist in protein synthesis can be visualized bound to 
the ribosome. An approximate model of the RNA structure in the large subunit has been constructed to fit a 7.5 A 

30 resolution electron microscopic map of the SOS subunit from E. coli and available biochemical data (Mueller et ai 
(2000) J. Moi Biol. 298 35-59). 

[0010] While the insights provided by EM have been useful, it has long been recognized that a full understanding of 
ribosome structure would derive only from X-ray crystallography. In 1979. Yonath and Wittman obtained the first po- 
tentially useful crystals of ribosomes and ribosomal subunits (Yonath et ai (1980) Biochem. Internal 1 : 428-435). By 
35 the mid 1 980s. scientists were preparing ribosome crystals for X-ray crystallography (Maskowski et ai (1 987) J. Moi 
Bioi 193: 818-822). The first crystals of 50S ribosomal subunit from H. marismortui were obtained in 1987. In 1991 . 
improvements were reported in the resolution of the diffraction data obtainable from the crystals of the SOS ribosomal 
subunit of H. marismortui {van Bohlen. K. (1991) J. Moi Biol 222: 11). 

[0011] In 1995. low resolution electron density maps for the large and small ribosomal subunits from halophilic and 
-to thermophilic sources were reported (Schlunzen et ai (1995) Biochem. Cell Biol. 73: 739-749). However, these low 
resolution electron density maps proved to be spurious (Ban et ai (1998) Cell 93: 1105-1115). 
[0012] The first electron density map of the ribosome that showed features recognizable as duplex RNA was a 9 A 
resolution X-ray crystallographic map of the large subunit from Haloarcula marismortui (Ban et at. (1 998) supra). Ex- 
tension of the phasing of that map to 5 A resolution made it possible to locate several proteins and nucleic acid se- 
■*5 quences. the structures of which had been determined independently (Ban et ai (1999) Nature 400: 841-847). 

[0013] At aboul the same time, using similar crystallographic strategies, a 7.8 A resolution map was generated of 
the entire Thetmus thermophilus ribosome showing the positions of IRNA molecules bound to ils A-. P-. and E- (protein 
exit site) sites (Cate et ai (1999) Science 285: 2095-2104). and a 5.5 A resolution map of the 30S subunit from T. 
thermophilus was obtained that allowed the fitting of solved protein structures and the interpretation of some of its RNA 
so features (Clemons. Jr. et ai (1999) Nature A0Q\ 833-840). Subsequently, a 4.5 A resolution map of the T. thermophilus 
30S subunit was published, which was based in part on phases calculated from a model corresponding to 28% of the 
subunit mass that had been obtained using a 6 A resolution experimental map (Tocilj ct at. (1999) Proc. NaV. Acad. 
Sci. USA 96 : 14252-14257). 

55 IV. Location of the Peptidyl Transferase Site in the Large Ribosomal Subunit 

[0014] It has been known for about 35 years that the peptidyl transferase activity responsible for the peptide bond 
formation that occurs during messenger RNA-directed protein synthesis is intrinsic to the large ribosomal subunit (Traut 
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etal. (1 964) J. Mol. Biol. 1 0: 63: Rychlik (1 966) Biochim. Biophys. Acta 114: 425; Monro (1 967) J. Mol. Biol. 26: 147-1 51 : 
Maden et al. (1968) 1 Mot. Biol 35: 333-345) and it has been understood for even longer that the ribosome contains 
proteins as well as RNA. In certain species of bacteria, tor example, the large ribosomal subunit contains about 35 
different proteins and two RNAs (Noller (1984) Ann. Rev. Biochem. 53: 119-162: Wittmann-Liebold et al (1990) The 
5 Ribosome: Structure. Function, and Evolution . (W.E. Hill et aL eds.) American Society for Microbiology. Washington. 
D.C. (1990). pp. 598-616). These findings posed three related questions. Which of the almost 40 macromolecular 
components of the large ribosomal subunit contribute to its peptidyl transferase site, where is that site located in the 
large subunit. and how does it work? 

[0015] By 1980, the list of components that might be part of the ribosome's peptidyl transferase had been reduced 

io to about half a dozen proteins and 23S rRNA (see Cooperman (1 980) Ribosomes: Structure. Function and Genetics . 
(G. Chambliss et aL eds.) University Park Press. Baltimore. MD (1980). 531-554), and following the discovery of 
catalytic RNAs (Guerrier-Takada et al. (1983) Ce//35: 849-857; Krugeref al (1982) Ce//31: 147-157). the hypothesis 
that 23S rRNA might be its sole constituent, which had been proposed years earlier, began to gain favor. In 1984. 
Noller and colleagues published affinity labeling results which showed that U261 9 and U2620 (in E. coli: U2584, U2585) 

'5 are adjacent to the CCA-end of P-site-bound tRNA (Barta et al (1 984) Proc. Nat Acad. Sci. USA 81 : 3607-361 1 : Vester 
et at. (1988) EMBO J. 7: 3577-3587). These nucleotides appear to be part of a highly conserved internal loop in the 
center of domain V of 23S rRNA. The hypothesis that this loop is intimately involved in the peptidyl transferase activity 
was supported by the observation that mutations in that loop render cells resistant to many inhibitors of peptidyl trans- 
ferase, and evidence implicating it in this activity has continued to mount (see. Noller (1991) Ann. Rev. Biochem, 60: 

20 191 -227: Garrett et al (1 996) Ribosomal RNA: Structure. Evolution. Processing and Function in Protein Biosynthesis . 
(R.A. Zimmerman and A.E. Dahlberg. eds.) CRC Press, Boca Raton. FL (1996). pp. 327-355). 
[001 6] Definitive proof that the central loop in domain V is the sole component of the ribosome involved in the peptidyl 
transferase activity has remained elusive, however. Studies have shown that it was possible to prepare particles that 
retained peptidyl transferase activity by increasingly vigorous deprotcinizations of large ribosomal subunits, however, 

25 it was not possible to produce active particles that were completely protein-free. Nevertheless, combined with earlier 
reconstitution results (Franceschi et al (1 990) J. Biol Chem. 265: 6676-6682). this work reduced the number of proteins 
that might be involved to just two: L2 and L3 (see. Green etal (1997) Annu. Rev. Biochem. 66: 679-716). More recently. 
Watanabe and coworkers reported success in eliciting peptidyl transferase activity from in vitro synthesized, protein- 
free 23S rRNA (Nitta et al (1998) RNA 4: 257-267), however, their observations appear not to have withstood further 

30 scrutiny. Thus the question still remained: is the ribosome a ribozyme or is it not? 

[0017] Over the years, the location of the peptidyl transferase site in the ribosome has been approached almost 
exclusively by electron microscopy. In the mid-1 980s evidence that there is a tunnel running through the large ribosomal 
subunit from the middle of its subunit interface side to its back (Milligan et al (1986) Nature 319: 693-695: Yonath et 
al (1987) Science 236: 813-816) began to accumulate, and there has been strong reason to believe that polypeptides 

35 pass through it as they are synthesized (Bernabeu etal (1982) Proc. Nat. Acad. Sci. USA 79: 3111-3115: Ryabova et 
al (1988) FEBS Letters 226: 255-260: Beckmann etal (1997) Science 278: 2123-2126). More recent cryo-EM inves- 
tigations (Frank et al (1995) Nature 376: 441-444: Frank et al. (1995) Biochem. Cell Biol 73: 757-765: Stark et al. 
(1995) supra) confirmed the existence of the tunnel and demonstrated that the CCA-ends of ribosome-bound tRNAs 
bound to the A- and P-sites are found in the subunit interlace end of the tunnel. Consequently, the peptidyl transferase 

to site must be located at that same position, which is at the bottom of a deep cleft in the center of the subunit interface 
surface of the large subunit. immediately below its central protuberance. 

[0018] The substrates of the reaction catalyzed at the peptidyl transferase site of the large subunit are an ammoacyl- 
tRNA (aa-tRNA) and a peptidyl-tRNA. The former binds in the ribosome's A-site and the latter in its P-site. The -amino 
group of the aa-tRNA attacks the carbon of the carbonyl acylating the 3' hydroxyl group of the peptidyl-tRNA. and a 

is tetrahedral intermediate is formed at the carbonyl carbon. The tetrahedral intermediate resolves to yield a peptide 
extended by one amino acid esterified to the A-site bound tRNA and a deacylated tRNA in the P-site. 
[0019] This reaction scheme is supported by Ihe observations of Yarus and colleagues who synthesized an analogue 
of the tetrahedral intermediate by joining an oligonucleotide having the sequence CCdA to puromycin via a phospho- 
ramide group (Welch et al (1995) Biochemistry 34 385-390). The sequence CCA. which is the 3' terminal sequence 

50 of all tRNAs. binds to the large subunit by itself, consistent with the biochemical data showing, that the interactions 
between tRNAs and the large subunit largely depend on their CCA sequences (Moazed et al (1991 ) Proc. Natl Acad. 
Sci. USA 88: 3725-3728). Puromycin is an aa-tRNA analogue that interacts with the ribosomal A-sitc. and the phos- 
phoramide group of the compound mimics the tetrahedral carbon intermediate. This transition state analogue. CCdA- 
phosphate-puromycin (CCdA-p-Puro). binds tightly to the ribosome. and inhibits its peptidyl transferase activity (Welch 

55 etal (1995) supra). 
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V. Structure Determination of Macromolecules Using X-ray Crystallography 

[0020] In order to better describe efforts undertaken to determine the structure of ribosomes. a general overview of 
X-ray crystallography is provided below. 

5 [0021] Each atom in a crystal scatters X-rays in all directions, but crystalline diffraction is observed only when a 
crystal is oriented relative to the X-ray beam so that the atomic scattering interferes constructively. The orientations 
that lead to diffraction may be computed if the wavelength of the X-rays used and the symmetry and dimensions of 
the crystal's unit cell are known (Blundell et at. (1976) Protein Crystallography (Molecular Biology Series). Academic 
Press. London). The result is that if a detector is placed behind a crystal that is being irradiated with monochromatic 

10 X-rays of an appropriate wavelength, the diffraction pattern recorded will consist of spots, each spot representing one 
of the orientations that gives rise to constructive interference. 

[0022] Each spot in such a pattern, however it is recorded, is characterized by (i) an intensity (often referred to as 
its blackness); (ii) a location, which encodes the information about diffraction orientation; and (iii) a phase. If all of those 
things are known about each spot in a crystal diffraction pattern, the distribution of electrons in the unit cell of the crystal 
is may be computed by Fourier transformation (Blundell et ai (1 976) supra), and from that distribution or electron density 
map. atomic positions can be determined. 

[0023] Unfortunately, the phase information essential for computing electron distributions cannot be measured di- 
rectly from diffraction patterns. One of the methods routinely used to determine the phases of macromolecules. such 
as proteins and nucleic acids, is called multiple isomorphous replacement (MIR) which involves the introduction of new 

20 x-ray scatterers into the unit cell of the crystal. Typically, these additions are heavy atoms, which make a significant 
contribution to the diffraction pattern. It is important that the additions be sufficiently low in number so that their positions 
can be located and that they leave the structure of the molecule or of the crystal cell unaltered, i.e., the crystals should 
be isomorphous. Isomorphous replacement usually is performed by diffusing different heavy-metal complexes into the 
channels of the preformed protein crystals. Macromolecules expose side chains (such as SH groups) in these solvent 

25 channels that are able to bind heavy metals. It is also possible to replace endogenous light metals in metalloproteins 
with heavier ones, e.g., zinc by mercury, or calcium by samarium. Alternatively, the isomorphous derivative can be 
obtained by covalently attaching a heavy metal to the macromolecule in solution and then subjecting it to crystallization 
conditions. 

[0024] Heavy metal atoms routinely used for isomorphous replacement include but are not limited to mercury, ura- 
30 nium. platinum, gold. lead, and selenium. Specific examples include mercury chloride, ethyl-mercury phosphate, and 
osmium pentamine. iridium pentamine. Since such heavy metals contain many more electrons than the light atoms 
(H. N. C. O. and S) of the protein, the heavy metals scatter x-rays more strongly All diffracted beams would therefore 
increase in intensity after heavy-metal substitution if all interference were positive. In fact, however, some interference 
is negative: consequently, following heavy-metal substitution, some spots increase in intensity, others decrease, and 
35 many show no detectable difference. 

[0025] Phase differences between diffracted spots can be determined from intensity changes following heavy-metal 
substitution. First, the intensity differences are used to deduce the positions of the heavy atoms in the crystal unit cell. 
Fourier summations of these intensity differences give maps, of the vectors between the heavy atoms, the so-called 
Patterson maps. From these vector maps, the atomic arrangement of the heavy atoms is deduced. From the positions 
40 of the heavy metals in the unit cell, the amplitudes and phases of their contribution to the diffracted beams of protein 
crystals containing heavy metals is calculated. 

[0026] This knowledge then is used to find the phase of the contribution from the protein in the absence of the heavy- 
metal atoms. As both the phase and amplitude of the heavy metals and the amplitude of the protein atone is known, 
as well as the amplitude ol the protein plus heavy metals (/.e., protein heavy-metal complex}, one phase and three 

•*5 amplitudes are known. From this, the interference of the X-rays scattered by the heavy metals and protein can be 
calculated to determine if the interference is constructive or destructive. The extent of positive or negative interference, 
with knowledge of the phase of the heavy metal, give an estimate of the phase of the protein. Because two different 
phase angles are determined and are equally good solutions, a second heavy-metal complex can be used which also 
gives two possible phase angles. Only one of these will have the same value as one of the two previous phase angles: 

50 jt therefore represents the correct phase angle. In practice, more than two different heavy-metal complexes are usually 
made in order to give a reasonably good estimate of the phase for all reflections. Each individual phase estimate 
contains experimental errors arising from errors in the measured amplitudes. Furthermore, for many reflections, the 
intensity differences are too small to measure after one particular isomorphous replacement, and others can be tried. 
[0027] The amplitudes and the phases of the diffraction data from the protein crystals are used to calculate an elec- 

55 tron-density map of the repeating unit of the crystal. This map then is interpreted to accommodate the residues of the 
molecule of interest. That interpretation is made more complex by several limitations in the data. First, the map itself 
contains errors, mainly due to errors in the phase angles. In addition, the quality of the map depends on the resolution 
of the diffraction data, which, in turn, depends on how well-ordered the crystals are. This directly influences the quality 
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of the map that can be produced. The resolution is measured in angstrom units (A): the smaller this number is. the 
higher the resolution and. therefore the greater the amount of detail that can be seen. 

[0028] Building the initial model is a trial-and-error process. First, one has to decide how a polypeptide chain or 
nucleic acid weaves its way through the electron-density map. The resulting chain trace constitutes a hypothesis by 
5 which one tries to match the density of side chains to the known sequence of the polypeptide or nucleic acid. When a 
reasonable chain trace has finally been obtained, an initial model is buitt that fits the atoms of the molecule into the 
electron density. Computer graphics are used both for chain tracing and for model building to present the data and 
manipulated the models. 

[0029] The initial model will contain some errors. Provided the crystals diffract to high enough resolution (e.g.. better 

10 than 3.5 A), most or substantially all of the errors can be removed by crystallographic refinement of the model using 
computer algorithms. In this process, the model is changed to minimize the difference between the experimentally 
observed diffraction amplitudes and those calculated for a hypothetical crystal containing the model (instead of the 
real molecule). This difference is expressed as an R factor (residual disagreement) which is 0.0 for exact agreement 
and about 0.59 for total disagreement. 

is [0030] In general, the R factor for a well-determined macromolecular structure preferably lies between 0. 1 5 and 0.35 
(such as less than about 0.24-0.28). The residual difference is a consequence of errors and imperfections in the data. 
These derive from various sources, including slight variations in the conformation of the protein molecules, as well as 
inaccurate corrections both for the presence of solvent and for differences in the orientation of the microcrystals from 
which Ihe crystal is built. This means thai the final model represents an average of molecules that are slightly different 

20 both in conformation and orientation. 

[0031] In refined structures at high resolution, there are usually no major errors in the orientation of individual resi- 
dues, and the estimated errors in atomic positions are usually around 0.1-0.2 A, provided the sequence of the protein 
or nucleic acid is known. Hydrogen bonds, both within the molecule of interest and to bound ligands. can be identified 
with a high degree of confidence. 

25 [0032] Typically, X-ray structures can be determined provided the resolution is better than 3.5 A. Electron-density 
maps are interpreted by fitting the known amino acid and/or nucleic acid sequences into regions of electron density. 

VI. The Need for Higher Resolution for the SOS Ribosomal Subunit 

30 [0033] Although the art provides crystals of the 50S ribosomal subunit. and 9A and 5 A resolution X-ray crystallo- 
graphic maps of the structure of the 50S ribosome. the prior art crystals and X-ray diffraction data are not sufficient to 
establish the three-dimensional structures of all 31 proteins and 3.043 nucleotides of the SOS ribosomal subunit. Thus, 
the prior art crystals and maps are inadequate for the structure-based design of active agents, such as herbicides, 
drugs, insecticides, and animal poisons. 

35 [0034] More detailed, higher resolution X-ray crystallographic maps are necessary in order to determine the location 
and three-dimensional structure of the proteins and nucleotides in ribosomes and ribosomal subunits ; particularly for 
the 50S ribosomal subunit. An accurate molecular structure of the 50S ribosomal subunit will not only enable further 
investigation and understanding of the mechanism of protein synthesis, but also the development of effective thera- 
peutic agents and drugs that modulate (e.g. induce or inhibit) protein synthesis. 

40 

SUMMARY OF THE INVENTION 

[0035] The present invention is based, in part, upon the determination of a high resolution atomic structure of a 
ribosomal subunit. more particularly, a large subunit of a ribosome. The high resolution structure has been determined 
•*5 for a large ribosomal subunit present in the organism Haloarcula marismortui. However, in view of the high level of 
sequence and structural homology between ribosomes of organisms in different kingdoms the structural information 
disclosed herein can be used to produce, using routine techniques, high resolution structural models of large ribosomal 
units for any organism of interest. 

[0036] Although there is significant homology between ribosomes of different organisms, for example, between ri- 
se bosomes of humans and certain human pathogens, there still are differences that can be exploited therapeutically. For 
example, many clinically and commercially significant protein synthesis inhibitors, for example, antibiotics such as 
streptomycin, tetracycline, chloramphenicol and erythromycin, selectively target bacterial ribosomes and disrupt bac- 
terial protein synthesis but at the same time do not target or otherwise significantly affect human ribosome function. 
As a result, over the years antibiotics have proven to be invaluable in the treatment of microbial infections in humans. 
55 However, there is still an ongoing need for new protein synthesis inhibitors, particularly because of the development 
of strains of pathogens that are resistant to known antibiotics. The information provided herein provides insights into 
the design of new protein synthesis inhibitors. 

[0037] The invention herein provides methods and compositions for resolving to a high resolution, the three-dimen- 
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sional structure of a ribosomal subunii of interest. In addition, the invention provides computer systems containing 
atomic co-ordinates that define at least a portion of the three-dimensional structure of a ribosome. more specifically, 
a large ribosomal subunit. In addition, the invention provides methods of using the atomic co-ordinates to identify new 
molecules that can selectively bind ribosomes. and that preferably act as selective inhibitors of protein synthesis, in 
5 addition, the invention provides new families of protein synthesis inhibitors. Each of these aspects of the invention are 
discussed in more detail below. 

[0038] In one aspect, the invention provides crystals, preferably untwinned crystals, of ribosomes or ribosomal sub- 
units that have an average thickness greater than about 1 5 urn. More specifically, the invention provides crystals having 
an average thickness from about 1 6 jam to about 65 urn. or from about 66 (am to about 1 05 urn. or from about 1 04 urn 
10 to about 1 55 urn . or from about 1 56 urn to about 205 urn. In particular, the invention provides crystals having an average 
thickness from about 1 00 urn to about 200 urn. 

[0039] In a preferred embodiment, the invention provides crystals that have an average thickness greater than about 
15 u.m and/or are untwinned wherein the crystals comprise the large ribosomal subunit. More particularly, the present 
invention provides such crystals wherein the large ribosomal subunit is a 50S or 60S ribosomal subunit. The crystals 

i5 may be obtained using the ribosomes or nbosomal subunits from prokaryotes or from eukaryotes. In a preferred em- 
bodiment, the invention provides crystals containing ribosomes or ribosomal subunits obtained from bacteria or ar- 
chaebacteria. more particularly, from the organism Haloarcula marismonui. However crystals may be obtained from 
ribosomes or ribosomal subunits from any organism, particularly from animals, more particularly from mammals, and 
even more particularly from humans. 

20 [0040] In another preferred embodiment, the crystals diffract X-rays to a resolution of at least about 4.5 A. more 
preferably to a resolution of at least about 3.0 A. and most preferably to a resolution of about 2.4 A for the determination 
of atomic co-ordinates of ribosomes or ribosomal subunits. In another preferred embodiment, the crystals of the present 
invention may also include a ligand. for example, a protein synthesis inhibitor, for example, an antibiotic, (such as a 
macrolidc antibiotic) complcxcd with, or bound to a ribosome or ribosomal subunit. 

25 [0041] In another aspect, the invention provides crystals of SOS ribosomal subunits whose atomic structure is char- 
acterized by the atomic co-ordinates deposited at the Protein Data Bank ID: 1FFK or 1JJ2. The invention further pro- 
vides phases computed from the co-ordinates of the deposited co-ordinates and the uses of such phase information. 
In a preferred embodiment, the invention provides crystals of 50S ribosomal subunits whose atomic structure is char- 
actered by the atomic co-ordinates deposited at the Protein Data Bank ID: 1 FFZ (large ribosomal subunit complexed 

30 with CCdA-p-Puro); or 1 FG0 (large ribosomal subunit complexed with a mini-helix analogue of aminoacyl-tRNA); as 
well as those ribosomal subunits whose atomic structure is characterized by the atomic coordinates listed in a file 
contained on Disk No. 3 of 3. specifically: large ribosomal subunit complexed with anisomycin (file name: anisomysin. 
pdb): large ribosomal subunit complexed with blasticidin (file name: blasticidin.pdb): large ribosomal subunit complexed 
with carbomycin (file name: carbomycin.pdb): large ribosomal subunit complexed with tylosin (file name: tylosin.pdb): 

35 large ribosomal subunit complexed with sparsomycin (file name: sparsomycin .pdb): large ribosomal subunit complexed 
with virginiamycin (file name: virginiamycin.pdb): or large ribosomal subunit complexed with spiramycin (file name: 
spiramycin. pdb). 

[0042] In another embodiment, the invention provides a method of obtaining an electron density map of a ribosomal 
subunit of interest that is only slightly different from the ribosomal subunit whose structure has already been determined, 

■to for example, by X-ray crystallography. The method comprises the steps of: (a) producing a crystal of the ribosomal 
subunit of interest, wherein the crystal is isomorphous: (b) obtaining diffraction amplitudes of the crystal produced in 
step (a): (c) combining the phases of the crystal of the ribosomal subunit whose structure is already known with the 
diffraction amplitudes obtained in step (b) to produce a combined data set: and (d) obtaining an electron density map 
of the selected ribosomal subunit based on the combined data set obtained in step (c). 

45 [0043] In another embodiment, the invention further provides a method of obtaining an electron density map of a 
ribosomal subunit of interest which is related to a ribosomal subunit whose structure is known. The method comprises 
the steps of. (a) producing a crystal of a ribosomal subunii of interest, wherein the crystal crystallines in a different unit 
cell with different symmetry than the crystal of the ribosomal subunit whose structure is known; (b) obtaining X-ray 
diffraction data of the crystal of interest: (c) inserting the atomic co-ordinates of the known ribosomal subunit into the 

so unit cell of the crystal of interest and modeling the co-ordinates such that they would be capable of producing theoretical 
X-ray diffraction data that resembles the X-ray diffraction data obtained in step (b): (d) obtaining phases of the crystal 
of interest from the modeled co-ordinates in step (c): and (c) obtaining an electron density map of the ribosomal subunit 
of interest from the X-ray diffraction data obtained in step (b) and the phases obtained in step (d). 
[0044] In addition, the invention provides a method of obtaining a model of a ribosomal subunit of interest, where 

55 the ribosomal subunit of interest diverges significantly from but is still homologous to the ribosomal subunit that was 
used to generate computed phases. The method comprises the steps of: (a) providing the atomic co-ordinates of the 
ribosomal subunit whose structure is known: and (b) using homology modeling to produce atomic co-ordinates of the 
ribosomal subunit of interest. 
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[0045] In another aspect, the invention provides a method of growing a crystal of a ribosome or a ribosomal subunit. 
as well as crystals resulting from such a method. The method comprises the steps of: (a) isolating a ribosome or a 
ribosomal subunit: (b) precipitating the ribosome or ribosomal subunit: (c) back-extracting the precipitated ribosome 
or ribosomal subunit to obtain a solution : (d) seeding the back-extracted solution: (e) growing a crystal of the ribosome 

5 or ribosomal subunit from the seeded solution by vapor diffusion at room temperature: and (f) harvesting the crystal. 
Optionally, the method may further comprise one or more of the following steps: (g) stabilizing the crystal by gradual 
transfer into a solution containing high salt concentration, for example : from about 1 .2 M salt to about 1.7 M salt: (h) 
maintaining the crystal under such a high salt concentration, and (i) flash freezing the crystal. 
[0046] In another aspect, the invention provides a method of obtaining X-ray diffraction data for a crystal of a ribosome 

10 or a ribosomal subunit. The method comprises the steps of: (a) obtaining a crystal of a ribosome or a ribosomal subunit. 
wherein the crystal has one or more of the following characteristics (1) an average thickness of greater than 15 um 
and (2) untwinned; and (b) using X-ray crystallography to obtain X-ray diffraction data for the crystal of the ribosome 
or ribosomal subunit. The present invention also discloses a method of obtaining an electron density map of a ribosome 
or a ribosomal subunit comprising using the X-ray diffraction data described herein to obtain an electron density map 

'5 of the ribosome or ribosomal subunit. 

[0047] In another aspect, the invention provides a method of obtaining X-ray diffraction data for a complex of a 
ribosome and a ligand : for example, a protein synthesis inhibitor or a complex of a ribosomal subunit and a ligand. 
The method comprises the steps of (a) obtaining a crystal of a ribosome or a ribosomal subunit, wherein the crystal 
has one or more of the following characteristics: (1) an average thickness of greater than 15 urn. and (2) untwinned; 

20 (b) diffusing a ligand into the crystal and permitting the ligand to attach to the crystal so as to form a complex; and (c) 
using X-ray crystallography to obtain X-ray diffraction data for the complex. In an alternative aspect, the invention 
provides a method of obtaining X-ray diffraction data for a complex of a ribosome and a ligand, for example, a protein 
synthesis inhibitor or for a ribosomal subunit and a ligand. The method comprises the steps of: (a) obtaining a co- 
crystal for a complex of a ribosome and a ligand or for a complex of a ribosomal subunit and a ligand, wherein the co- 

25 crystal has one or more of the following characteristics: (1) an average thickness of greater than 15 urn, and (2) un- 
twinned; and (b) using X-ray crystallography to obtain X-ray diffraction data for the complex. In either method, the X- 
ray diffraction data can be used to produce an electron density map for a complex of a ribosome and a ligand or for a 
complex of a ribosomal subunit and a ligand. 

[0048] In a preferred embodiment, the invention provides a method of locating the attachment of such a ligand to a 
30 ribosome or the attachment of the ligand to a ribosomal subunit. The method comprises the steps of: (a) obtaining X- 
ray diffraction data for a ribosome or for a ribosomal subunit: (b) obtaining X-ray diffraction data for a complex of a 
ribosome and a ligand or for a complex of a ribosomal subunit and a ligand: (c) subtracting the X-ray diffraction data 
obtained in step (a) from the X-ray diffraction data obtained in step (b) to obtain the difference in the X-ray diffraction 
data: (d) obtaining phases that correspond to X-ray diffraction data obtained in step (a) using one or more of the 
35 techniques selected from the group consisting of Ml R. Ml RAS , SAD and computation from an existing atomic structure; 
(e) utilizing the phases obtained in step (d) and the difference in the X-ray diffraction data obtained in step (c) to compute 
a difference Fourier image of the ligand: and (f) locating the attachment of the ligand to a ribosome or the attachment 
of the ligand to a ribosomal subunit based on the computations obtained in step (e). 

[0049] In another embodiment, the invention provides an alternative method of obtaining a map of such a ligand 
■to attached to a ribosome or of a ligand attached to a ribosomal subunit. The method comprises the steps of: (a) obtaining 
X-ray diffraction data for a ribosome or for a ribosomal subunit: (b) obtaining X-ray diffraction data for a complex of a 
ribosome and a ligand or a complex of a ribosomal subunit and a ligand: (c) obtaining phases that correspond to X- 
ray diffraction data obtained in step (a) using one or more of the techniques selected from the group consisting of MIR : 
MIRAS. SAD and computation from an existing atomic structure: and (d) utilizing the phases obtained in step (c) and 
the X-ray diffraction data obtained in step (b) to compute a map of the ligand and the ribosome or of the ligand and 
the ribosomal subunit. 

[0050] In another aspect, the invention provides a computer system comprising: (a) a memory having stored therein 
data indicative of atomic co-ordinates derived from an electron density map having a resolution of at least about 4.5 
A. more preferably of at least about 3.0 A. and most preferably of about 2.4 A and defining a ribofunctional locus of a 

so large subunit of a ribosome: and (b) a processor in electrical communication with the memory, the processor comprising 
a program for generating a three-dimensional model representative of the ribofunctional locus. In a preferred embod- 
iment, the computer system further comprises a device, for example, a computer monitor, or terminal for providing a 
visual representation of the molecular model. In another preferred embodiment, the processor further comprises one 
or more programs to facilitate rational drug design. 

55 [0051] In a preferred embodiment, the computer system further comprises at least a portion of the atomic co-ordinates 
deposited at the Protein Data Bank under accession number PDB ID: 1 FFK. 1 FF2, 1 FG0. or 1JJ2. In another preferred 
embodiment, the atomic co-ordinates further define at least a portion of a protein synthesis inhibitor for example, an 
antibiotic, more specifically an antibiotic selected from the group consisting of anisomycin, blasticidin, carbomycin, 
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sparsomycin. spiramycin, tylosin and virginiamycin. complexed with a ribofunctional locus, for example, at least a 
portion of the atomic co-ordinates recorded on compact disk Disk No. 3 of 3. included herein. 
[0052] In a preferred embodiment, the ribofunctional locus comprises at least a portion of an active site in the nbos- 
omal subunit. for example, at least a portion of one or more of: a peptidyl transferase site (a portion of which may be 

5 defined by a plurality of residues set forth in Table 5) an A-site (a portion of which may be defined by a plurality of 
residues set forth in Table 6): a P-site (a portion of which may be defined by a plurality of residues set forth in Tabie 
7): a polypeptide exit tunnel (a portion of which may be defined by a plurality of residues set forth in Table 8. Table 9 
or Table 1 0): or an antibiotic binding domain (a portion of which may be defined by a plurality of residues set forth in 
Table 11. Table 12. Table 13. Table 14. Table 15. Table 16 or Table 17). Plurality of residues shall be considered to 

io include at least 3 residues, preferably at least 5 residues, and more preferably at least 1 0 residues. The ribofunctional 
locus may be defined by atoms of ribosomal RNA. one or more ribosomal proteins, or a combination of ribosomal RNA 
and one or more ribosomal proteins. 

[0053] In another preferred embodiment, the atomic co-ordinates are produced by molecular modeling Using the 
atomic co-ordinates provided herein, the skilled artisan may generate models of any ribosome of interest using con- 

15 ventional techniques, for example, conventional homology modeling, and or molecular replacement techniques, in 
another embodiment, the atomic co-ordinates are produced by homology modeling using at least a portion of the atomic 
co-ordinates deposited at the Protein Data Bank under accession number PDB ID: 1 FFK : 1 FFZ. 1 FG0. or 1JJ2, or the 
atomic co-ordinates included in compact disk Disk No. 3 of 3. In another embodiment the atomic co-ordinates are 
produced by molecular replacement using at least a portion of the atomic co-ordinates deposited al the Protein Data 

20 Bank under accession number PDB ID: 1FFK : 1 FFZ. 1FG0. or 1JJ2. or the atomic co-ordinates included m compact 
disk Disk No. 3 of 3. 

[0054] In a preferred embodiment, the atomic co-ordinates define residues that are conserved between ribosomes 
or ribosomal subunits of pathogens, for example, prokaryotic organisms. and ; optionally but more preferably, are also 
absent from ribosomes or ribosomal subunits of a host organism, for example, a human. In another preferred cmbod- 

25 iment. the atomic co-ordinates may define residues that are conserved between ribosomes or ribosomal subunits of 
prokaryotic organisms, for example, bacteria, and. optionally but more preferably, are also absent from ribosomal 
subunits of eukaryotes. for example, a mammal, more preferably, a human. This information can be used, for example, 
via the use of one or more molecular models, to identify targets for rational drug design that may be exploited to develop 
new molecules, for example, protein synthesis inhibitors, that disrupt protein synthesis in a pathogen, for example, a 

30 bacteria, but do not disrupt or otherwise substantially affect protein synthesis in a host organism, for example, a human. 
[0055] In another aspect, the invention provides a variety of methods for designing, testing and refining new mole- 
cules via rational drug design. For example, the invention provides a method that comprises the steps of: (a) providing 
a model, for example, a molecular model, having a ribofunctional locus of a large subunit of a ribosome, wherein the 
model is defined by the spatial arrangement of atoms derived from an electron density map having a resolution of at 

35 least about 4.5 A. more preferably to at least about 3.0 A. and most preferably to about 2.4 A: and (b) using the model 
to identify a candidate molecule having a surface complementary to the ribofunctional locus. Preferably, the candidate 
molecule stereochemical^ interfits and more preferably binds with the ribofunctional locus of the large subunit of the 
ribosome. 

[0056] In a preferred embodiment, the method comprises one or more additional steps of: producing the candidate 
•to molecule identified in such a method: determining whether the candidate molecule, when produced, modulates (for 
example, induces or reduces) ribosomal activity: identifying a modified molecule: producing the modified molecule; 
determining whether the modified molecule, when produced, modulates ribosomal activity; and producing the modified 
molecule for use either alone or in combination with a pharmaceutical^ acceptable carrier. The candidate molecule 
and/or tne modified molecule may be an antibiotic or antibiotic analogue, for example, a macrolide antibiotic or a 
•*s macrolide analogue. 

[0057] In a preferred embodiment, the ribofunctional locus used in such a method comprises at least a portion of an 
active site in the ribosomal subunit. In another preferred embodiment, the ribofunctional locus is defined by at least a 
portion of one or more of: a peptidyl transferase site (a portion of which may be defined by a plurality of residues set 
forth in Table 5): an A-site (a portion of which may be defined by a plurality of residues set forth in Table 6): a P-site (a 

50 portion of which may be defined by a plurality of residues set forth in Table 7): a polypeptide exit tunnel (a portion of 
which may oe defined by a plurality of residues set forth in Table B. Table 9 or Table 1 0): or an antibiotic binding domain 
(a portion of which may be defined by a plurality of residues set forth in Tabic 11 , Table 12. Tabic 13. Tabic 14. Tabic 
1 5, Table 16 or Table 1 7). The ribofunctional locus may be defined by atoms of ribosomal RNA. one or more ribosomal 
proteins, or a combination of ribosomal RNA and one or more ribosomal proteins. 

55 [0058] In another preferred embodiment, the atomic co-ordinates are used to produce a molecular model in an elec- 
tronic form The atomic co-ordinates preferably are produced by molecular modeling In another embodiment, the 
atomic co-ordinates are produced by homology modeling using at least a portion of (he atomic co-ordinates deposited 
at the Protein Data Bank under accession number PDB ID: 1 FFK, 1FFZ. 1FG0. or 1JJ2. or the atomic co-ordinates 
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included in compact disk Disk No. 3 of 3. In another embodiment, the atomic co-ordinates are produced by molecular 
replacement using at least a portion of the atomic co-ordinates deposited at the Protein Data Bank under accession 
number PDB ID: 1FFK. 1FFZ 1FG0. or 1JJ2. or the atomic co-ordinates included in compact disk Disk No. 3 of 3. 
[0059] In a preferred embodiment, the atomic co-ordinates may define residues that are conserved among ribosomes 

5 or ribosomal subunits of pathogens, for example, prokaryotic organisms, and. optionally but more preferably, are also 
absent in ribosomes or ribosomal subunits of a host organism, for example, a human. In another preferred embodiment, 
the atomic co-ordinates may define residues that are conserved between ribosomes or ribosomal subunits of prokary- 
otic organisms, for example, bacteria, and. optionally but more preferably, are also absent from ribosomes or ribosomal 
subunits of eukaryotes. for example, a mammal, more preferably a human. This information can be used, for example. 

10 via the use of one or more molecular modeis. to identify targets for rational drug design that may be exploited to develop 
new molecules, for example, protein synthesis inhibitors, that disrupt protein synthesis in a pathogen, for example, a 
bacteria but do not disrupt or otherwise substantially affect protein synthesis in a host organism, for example ; a human. 
[0060] In a preferred embodiment, the invention provides a method of obtaining a modified agent. The method com- 
prises the steps of: (a) obtaining a crystal of a ribosome or of a ribosomal subunit; (b) obtaining the atomic co-ordinates 

'5 of the crystal: (c) using the atomic co-ordinates and one or more molecular modeling techniques, for example, graphic 
molecular modeling and computational chemistry, to determine how to modify the interaction of an agent with a ribosome 
or ribosomal subunit: and (d) modifying the agent based on the determinations obtained in step (c) to produce a modified 
agent. Alternatively, the method further comprises contacting the modified agent with a ribosome or ribosomal subunit 
and delecting the interaction of the agent to the ribosome or ribosomal subunit. The present invention also provides 

20 such a modified agent (preferably a therapeutic agent), wherein the modified agent binds differently to a ribosome or 
ribosomal subunit than does the agent from which the modified agent was derived. 

[0061] In another aspect, the invention provides new protein synthesis inhibitors that disrupt the function of a target 
ribosome. These inhibitors can be readily designed and tested as disclosed herein. 

[0062] One type of protein synthesis inhibitor of the invention comprises: a first binding domain having a surface, for 
25 example, a solvent accessible surface, that mimics or duplicates a surface of a known first molecule, for example, a 
first antibiotic, that binds with a first contact site, for example, a first ribofunctional locus, in or on a large ribosomal 
subunit: and a second binding domain having a surface, for example, a solvent accessible surface, that mimics or 
duplicates a surface of a known second molecule, for example, a second antibiotic, that binds with a second contact 
site, for example, a second ribofunctional locus, in or on the ribosomal subunit. The first domain is attached to the 
30 second domain so as to permit both the first domain and the second domain to bind simultaneously with their respective 
contact sites within or on the ribosomal subunit so as to disrupt protein synthesis in a ribosomal subunit. 
[0063] Another type of protein synthesis inhibitor is a synthetic, engineered molecule that comprises: a binding do- 
main having a surface, for example, a solvent accessible surface, that mimics or duplicates a solvent accessible surface 
of a known molecule, for example, a first known antibiotic, which binds with a contact site, for example, a ribofunctional 
35 locus in or on a ribosomal subunit; and an effector domain attached to the binding domain which, upon binding of the 
binding domain with the contact site, occupies a space within or adjacent the ribosomal subunit thereby to disrupt 
protein synthesis in the ribosomal subunit. 

[0064] The foregoing aspects and embodiments of the invention may be more fully understood by reference to the 
following figures, detailed description and claims. Further advantages are evident from the drawings (provided in both 
*o grayscale and color). 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0065] The patent or application file contains at least one drawing executed in color. Copies of this patent or patent 
•*5 application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary 
fee. 

[0066] Color renditions similar to many of the following figures can be found, for example, in Ban el al. (2000) Science 
289: 905-920; or Nissen et a/. (2000) Science 289: 920-929. 

[0067] The objects and features of the invention may be more fully understood by reference to the drawings described 
so below: 

[0068] Figures 1 (A)-(E) show the electron density from a 2.4 A resolution electron density map. Specifically, Figure 
1(A) shows a stereo view of a junction between 23S rRNA domains II, III, and IV. Figure 1(B) shows the. extended 
region of protein L2 interacting with surrounding RNA. Figure 1(C) shows in detail the L2 region with a bound Mg 2+ 
ion. Figure 1(D) shows in detail L2 with amino acid side chains. Figure 1(E) shows helices 94-97 from domain 6. 
55 [0069] Figure 2 shows the H. marismortui large ribosomal subunit in the crown view. The subunit is shown in the 
crown view, with its L7/L12 stalk to the right, its L1 stalk to the left, and its central protuberance (CP) up. In this view, 
the surface of the subunit that interacts with the small ribosomal subunit faces the reader. RNA is shown in gray in a 
space-filling rendering. The backbones of the proteins visible are rendered in gold. A transition state analogue bound 
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to the peptidyl transferase site of the subunit is indicated in green. The particle is approximately 250 A across. 
[0070] Figures 3(A)-(B) show the secondary structure of the 23S rRN A from H. marismortui. The secondary structure 
of this 23S rRNA is shown in a standardized format. Figure 3(A) shows the 5' half of the large subunit rRNA. Figure 
3(B) show the 3' half of the large subunit rRNA. This diagram shows all the base pairings seen in the crystal structure 
5 of the large subunit that are stabilized by at least two hydrogen bonds. Pairings shown in red were predicted and are 
observed. Those shown in green were predicted, but are not observed. Interactions shown in blue are observed, but 
were not predicted. Bases shown in black do not appear to be involved in pairing interactions. Sequences that cannot 
be visualized in the 2.4 A resolution electron density map are depicted in gray with the secondary structures predicted 
for them. 

10 [0071] Figures 4(A)-(L) show the tertiary structures of the RNA domains in the h. marismortui large ribosomal sub- 
unit, its RNA as a whole, and schematics of its RNAs. Specifically Figures 4(A) and 4(B) show the RNA structure of 
the entire subunit. Domains are color coded as shown in the schematic of Figure 5(C). Figure 4(A) shows the particle 
in the crown view. Figure 4(B) shows the image in Figure 4(A) rotated 180° about an axis running vertically in the 
plane of the image. Figures 4(C) and 4(D) show a schematic diagram of 23S rRNA and the secondary structure of 5S 

is rRNA. Figure 4(C) shows a schematic diagram of 23S rRNA secondary structure of Figure 3 with helices numbered 
according to Letters et at. ((1987) J. Mot. Biol. 195: 43-61), and the domains of the molecule are indicated by color 
shading Figure 4(D) shows the secondary structure of 5S rRNA from H. marismortui. Thick lines joining bases rep- 
resent Watson-Crick pairing. Bases joined by a lower case "c" indicate non-Watson-Crick pairing. Bases joined by thin 
lines inleracl via a single hydrogen bond. Bases shown in black are not paired. Bases shown in red are phylogenelically 

20 predicted pairing that have now been confirmed (Symanski et at. (1998) Nucl. Acids Res. 26: 156-159). Pairs shown 
in blue are observed, but were not predicted, and pairs shown in green were predicted but are not observed. Figures 
4(E) through 4(L) show stereo views of the RNA domains in the 23S rRNA and of 5S rRNA. Each domain is color- 
coded from its 5' end to its 3' end to facilitate the viewer following its trajectory in three-dimensions. The surfaces where 
the most important intcr-domain interactions occur arc shown in mono to the right of the stereo views. Figure 4(E) 

25 shows domain I: Figure 4(F) shows domain II: Figure 4(G) shows domain III: Figure 4(H) shows domain IV: Figure 
4(l) shows domain V. crown view: Figure 4(J) shows domain V. back view: Figure 4(K) shows domain VI: and Figure 
4(L) shows 5S rRNA. 

[0072] Figures 5(A)-(C) show conservations and expansions in the 23S rRNA of H. marismortui. The generality of 
the RNA in these images is gray. Sequences that are found to be >95% conserved across the three phylogenetic 
30 kingdoms are shown in red. Sequences where expansion in the basic 23S structure is permitted are shown in green 
(Gutell et at. (2000) supra). Specifically. Figure 5(A) shows the particle rotated with respect to the crown view so that 
its active site cleft can be seen. Figure 5(B) shows the crown view. Figure 5(C) shows the back view of the particle. 
i.e., the crown view rotated 180° about its vertical axis. 

[0073] Figures 6(A)-(I) show structures of some large subunit ribosomal proteins that have non-globular extensions. 

35 Only the backbones of the proteins are shown. The globular domains of these proteins are shown in green, and their 
non-globular extensions are depicted in red. The positions of the zinc ions in L44e and L37e are also indicated. Figure 
6(A) shows L2: Figure 6(B) shows L3: Figure 6(C) shows L39: Figure 6(D) shows L4; Figure 6(E) shows L1 5; Figure 
6(F) shows L21 e: Figure 6(G) shows L44e: Figure 6(H) shows L37e. Figure 6(1) shows L1 9: 
[0074] Figures 7(A)-(C) show proteins that appear on the surface of the large ribosomal subunit. The RNA of the 

•to subunit is shown in gray as in Figure 2 and protein backbones are shown in gold. Specifically Figure 7(A) shows the 
subunit in the crown view of the subunit. Figure 7(B) shows the back side of the subunit in the crown view orientation. 
Figure 7(C) shows the bottom view: the end of the peptide tunnel appears in the center of this image The proteins 
visible in each image are identified in the small images at the lower left corner of the Figure. 

[0075] Figures 8(A)-(F) show the protein distribution and protein-RNA interactions in the large ribosomal subunit. 

is Specifically. Figure 8(A) shows the structures of proteins m the neighborhood of the end of the peptide tunnel and how 
they relate to the RNA sequences with which they interact. Protein L22 extends a long hairpin extension inside the 
23S rRNA. L24 has a similar extension but the entire protein is on the surface of the particle. L39 is the only prolem 
in the subunit that lacks tertiary structure, while L37e has both NH 2 and C0OH terminal extensions. L19 is unique in 
having two globular domains on the surface of the subunit connected by an extended sequence that weaves through 

so the RNA. The end of L39 (green) actually enters the tunnel, while L37e (red) is entirely surrounded by RNA. Figure 8 
(B) shows the non-globular extensions of L2 and L3 reaching through the mass of 23S rRNA towards the peptidyl 
transferase site, which is marked by a CCdA-p-puromycin molecule. Figure 8(C) shows L22 interacting with portions 
of all six of the domains of 23S rRNA. Figure 8(D) shows a schematic of 23S rRNA showing the locations of the 
sequences that make at least van der Waals contact with protein (red). Figure 8(E) shows a stereo view of the proteins 

55 of the large ribosomal subunit with all the RNA stripped away. Proteins are color red as an aid to visualization only. 
Figure 8(F) shows a cross section of the subunit in the area of the tunnel exit. Protein L22 is shown as ribbons in red ; 
and the hairpin loop where mutations confer erythromycin resistance is shown in orange Atoms on the surface are 
shown in gray, protein atoms are shown in green, and atoms at the slice interface are shown in blue. 
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[0076] Figures 9(A)-(C) show chemical structures of ribosome peptidyl transferase substrates and analogues Spe- 
cifically. Figure 9(A) shows the tetrahedral carbon intermediate produced during peptide bond formation : the tetrahedral 
carbon is indicated by an arrow. Figure 9(B) shows the transition state analogue formed by coupling the 3' OH of CCdA 
to the amino group of the O-methyl tyrosine residue of puromycin via a phosphate group. CCdA-p-Puro (Welch et at. 
s (1 995) supra). Figure 9(C) shows an amino-N-acylated mini helix constructed to target the A-stte. The oligonucleotide 
sequence 5 l phosphate CCGGCGGGCUGGUUCAAACCGGCCCGCCGGACC 3' (SEQ ID NO: 1) puromycin should 
form 12 base pairs. The construct was based on a mini helix which is a suitable substrate for amino-acylation by Tyr- 
tRNA synthetase The 3' OH of its terminal C is coupled to the 5' OH of the N6-dimethyl A moiety of puromycin by a 
phosphodiester bond. 

10 [0077J Figures 10(A)-(C) show experimentally phased electron density maps of the substrate analogue complexes 
at 3.2 A resolution, with models superimposed (oxygen, red: phosphorus, purple: nitrogen, blue: and carbon, green 
for rRNA and yellow for substrate). Specifically. Figure 10(A) shows an F 0 (complex)- F 0 (parent) difference electron 
density map with a skeletal model of CCdA-p-Puro superimposed. Figure 10(B) shows a 2F 0 (complex)- F 0 (parent) 
electron density map of the CCdA-p-Puro in the active site region with the structures of the ribosome and inhibitor 

J5 superimposed showing the proximity ol the N3 of A2486 (2451 ) to the phosphate, non-bridging oxygen in this complex. 
Figure 1 0(C) shows an F 0 (complex)- F 0 (parent) differences electron density map of the tRNA acceptor stem analogue 
with a skeletal model of CCpuro superimposed. There is density only for the ribose and phosphate of C74 and none 
for the rest of the RNA hairpin. 

[0078] Figures 1 1(A) and (B) show a combined model of the CCA portion of the mini helix bound to the A-sile and 
20 CCdA-p-Puro bound to the A- and P-sites. color coded as in Figure 2. Specifically, Figure 11(A) shows the base- 
pairing interactions between the P-site C74 and C75 and the P loop of 23S rRNA on the left and the A-site C75 with 
the A loop of 23S rRNA on the right. The catalytic A24B6 is near the phosphate oxygen (P) that is the analogue of the 
tetrahedral intermediate oxyanion. Figure 11(B) shows A2637 (in all blue) lying between the two CCA's and A2486 
(green) whoso N3 approaches a non-bridging phosphate oxygen. The N1 atoms of the A76 bases from the A- and P- 
25 site tRNAs are making nearly identical interactions with a ribose 2' OH in both the A- and P-loops. respectively, and 
an approximate 2-fold axis relates these residues. 

[0079] Figure 12 shows a space filling model of the 23S and 5S rRNA, the proteins and the CCdA-p-Puro inhibitor 
viewed down the active site cleft in a rotated "crown view." The bases are white and the sugar phosphate backbones 
are yellow The inhibitor is shown in red and the numbered proteins are shown in blue. The L1 and L11 proteins posi- 

30 tioned at lower resolution are in blue backbone. The central protuberance is labeled CP. 

[0080] Figure 13(A) shows a stereo view diagram of the three-dimensional distribution of the residues comprising 
the loops A and P and the peptidyl transferase loop. Figure 13(B) shows a stereo view of the central loop in domain 
V from the direction of the tunnel. The residues are color coded based on mutations which confer antibiotic resistance. 
Figure 13(C) shows domain V active site with its central loop shown as the secondary structure. 

35 [0081] Figures 14(A) and (B) show the closest approach of polypeptides to the peptidyl transferase active site 
marked by a ball and stick representation of the Varus inhibitor. CCdA-p-Puro. Specifically. Figure 14(A) shows a coil 
representation of domain V RNA backbone in red and bases in gray and a ribbon backbone representation of all thirteen 
proteins that interact with it. Figure 1 4(B) shows a close-up view of the active site with the RNA removed. The phosphate 
of the Yarus analogue and the proteins whose extensions are closest to the inhibitor are shown in ribbon with their 

•to closest side-chains in all atom representation. The distances in A between the closest protein atoms and the phos- 
phorous analogue of the tetrahedral carbon (pink) are shown, as is a modeled peptide (pink). 
[0082] Figure 15 shows conserved nucleotides in the peptidyl transferase region that binds CCdA-p-Puro A space 
■ filling representation of the active site region with the Varus inhibitor viewed down the active site cleft. All atoms be- 
longing to 23S rRNA nucleotides that are 95% conserved in all three kingdoms (Gutell et ai (2000) supra) are colored 

■« red and all other nucleotides are white the inhibitor is colored blue. 

[0083] Figures 16(A)-(C) show the catalytic apparatus of the peptidyl transferase active site. Specifically, Figure16 
(A) shows stereo view of a portion ol the experimental 2.4 A resolution electron density map (Ban et ai (2000) Science 
289: 905-920) of the large subunit in the region of the catalytic site in stereo. The structure the RNA involved in inter- 
actions with A2486 is superimposed. Residues G2102 (2061) and G2482 (2447) are hydrogen bonded to the N6 of 

so A2486 (2451) and G2482 which interacts with a neighboring phosphate group Figure 16(B) shows a skeletal repre- 
sentation with dashed hydrogen-bonds showing G2482. G2102. A2486 and the buried phosphate that is proposed to 
result in a charge relay through G2482 to the N3 of A2486. Figure 16(C) shows the normal and rarer iminc tautomeric 
forms of G2482 and A2486 that are proposed to be stabilized by the buried phosphate of residue 2485. 
[0084] Figures 17(A)-(C) show the proposed mechanism of peptide synthesis catalyzed by the ribosome. Specifi- 

55 cally. Figure17(A) shows the N3 of A2486 abstracting a proton from the NH 2 group as the latter attacks the carbonyl 
carbon of the peptidyl-tRNA. Figure 17(B) shows a protonated N3 stabilizing the tetrahedral carbon intermediate by 
hydrogen bonding to the oxyanion. Figure 17(C) shows the proton transferred from the N3 to the peptidyl tRNA 3' OH 
as the newly formed peptide deacylates. 
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[0085] Figures 18(A) and (B) show space filling representations of the SOS ribosomal subunit with the 3 tRNA mol- 
ecules, in the same relative orientation that they are found in the 70S ribosome structure by Noller and colleagues 
docked onto the CCA's bound in the A-Site and P-Site. Specifically Figure 18(A). shown on the left-hand side, shows 
the whole subunit in rotated crown view with the rRNA in yellow, proteins in pink and tRNAs in orange. Figure 18(B), 

5 shown on the right-hand side, shows a close-up view showing the numbered proteins are in pink and the rRNA in blue. 
A backbone ribbon representation of the A-. P-. and E-sites are shown in yellow, red and white, respectively. 
[0086] Figures 19(A>-(F) show the polypeptide exit tunnel. Specifically. Figure 19(A) shows the subunit cut in half, 
roughly bisecting its central protuberance and its peptide tunnel along the entire length. The two halves have been 
opened like the pages of a book. All ribosome atoms are shown in CPK representation, with all RNA atoms that do not 

io contact solvent shown in white and all protein atoms that do not contact solvent shown in green. Surface atoms of both 
protein and RNA are color-coded with carbon in yellow : oxygen in red. and nitrogen in blue. A possible trajectory for 
a polypeptide passing through the tunnel is shown as a white ribbon. The peptidyl transferase site (PT) is also shown. 
Figure 19(B) shows detail of the polypeptide exit tunnel with the distribution of polar and non-polar groups, with atoms 
colored as m Figure 19(A) the constriction in the tunnel formed by proteins L22 and L4 (green patches close to PT). 

is and the relatively wide exit of the tunnel, A modeled polypeptide is in white. Figure 19(C) shows the tunnel surface 
with backbone atoms of the RNA color coded by domain: domain I (white). II (light blue). Ill (gold). IV (green). V (orange). 
5S (pink) and proteins are blue. The peptidyl transferase center (PTC) is shown. Figure 19(D) is a space filling repre- 
sentation of the large subunit surface at the tunnel exit showing the arrangement of proteins, some of which might play 
roles in protein secretion. The RNA is in white (bases) and yellow (backbone) and the numbered proteins are blue. A 

20 modeled polypeptide is exiting the tunnel in red. Figure 19(E) shows a close-up view of the half of the exit tunnel 
showing the relationship of the peptidyl transferase center (PTC) to proteins L4 (yellow) and L22 (blue). The Yarus 
inhibitor and a modeled peptide are purple and the 23S rRNA is in red and white. Figure 19(F) shows a secondary 
structure schematic of 23S rRNA identifying the sequences that contact the tunnel in red. 

[0087] Figure 20 is a picture showing the spatial relationship between the antibiotic anisomycin bound to a large 
25 ribosomal subunit. 

[0088] Figure 21 is a picture showing the spatial relationship between the antibiotic blasticidin bound to a large 
ribosomal subunit. 

[0089] Figure 22 is a picture showing the spatial relationship between the antibiotics carbomycm and tylosin bound 
to a large ribosomal subunit. 

30 [0090] Figure 23 is a picture showing the spatial relationship between the antibiotic sparsomycin bound to a large 
ribosomal subunit. 

[0091] Figure 24 is a picture showing the spatial relationship between the antibiotics virginiamyctn (streptogramm 
A) and carbomycm bound to a large ribosomal subunit. 

[0092] Figure 25 is a picture showing the spatial relationship of certain antibiotics, namely, anisomycin. blasticidin, 
35 carbomycin. and virginiamycin. bound to a large ribosomal subunit. The locations of the bound antibiotics are shown 
relative to the ribosomal A-site. P-site. and polypeptide exit tunnel 

[0093] Figures 26(A)-(C) are pictures showing a peptidyl transferase site disposed within a large ribosomal subunit. 
Figure 26A shows a bound tylosin molecule, and identifies a disaccharide binding pocket and two cavities denoted 
"cavity V and "cavity 2." Figures 26(B) and (C) are provided on the left hand side to orient the reader to the locations 
40 of the peptidyl transferase site (PT) and polypeptide exit tunnel in the large ribosomal subunit 

[0094] Figure 27 is a schematic representation of a computer system useful in molecular modeling a ribosomal 
subunit and/or for performing rational drug design. 

[0095] Figure 28 is a schematic representation ot certain potential drug target sites in a large ribosomal subunit. 
[0096] Figures 29(A)-(D) are pictures showing the residues within the wall ot the polypeptide exit tunnel that are 

**5 conserved (red) or non-conserved (blue) between E. colt and rat. The ribosomal subunit has been sliced down the 
polypeptide exit tunnel witn one half of the polypeptide exit tunnel shown in Figure 29(A), and the other half of the 
polypeptide exil lunnel is shown tn Figure 29(B). Figure 29(C) is provided to orient the reader to show the location of 
the portion of the ribosomal subunit shown in Figure 29(A) relative to the ribosomal subunit as a whole. Figure 29(D) 
is provided to orient the reader to show the location of the portion of the ribosomal subunit shown in Figure 29(B) 

50 relative to the large ribosomal subunit as a whole. 

DETAILED DESCRIPTION OF THE INVENTION 
I. Definitions 

55 

[0097] As used herein, the term "active site" refers to regions on a ribosome or ribosomal subunit that are directly 
involved in protein synthesis, e.g.. the peptidyl transferase site, the elongation factor binding site, and other similar sites 
[0098) As used herein, the terms "agent" and "ligand" are used synonymously and refer to any atom, molecule, or 
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chemical group which binds with a ribosome, ribosomal subunit or ribosome fragment. Thus, ligands include, but are 
not limited to. a single heavy atom, an antibiotic, a tRNA, a peptidyl tRNA. an aminoacyl tRNA. or a signal recognition 
particle ("SRP"). 

[0099] As used herein, "archaebacteria" refers to the kingdom of monerans that includes methane producers, sulf ur- 
5 dependent species, and many species that tolerate very salty or hot environments. 

[0100] As used herein, the term "A-site" refers to the locus occupied by an aminoacyl-tRNA molecule immediately 
prior to its participation in the peptide-bond forming reaction. 

[0101] As used herein, the term "asymmetric unit" refers to a minimal set of atomic co-ordinates that when operated 
upon by the symmetry operations of a crystal will regenerate the entire crystal. 

w [01 02] As used herein, "at least a portion of" or "at least a portion of the three-dimensional structure of" is understood 
to mean a portion of the three-dimensional structure of a ribosome or ribosomal subunit. including charge distribution 
and hydrophilicity/hydrophobicity characteristics, formed by at least three, more preferably at least three to ten, and 
most preferably at least ten contiguous amino acid and/or nucleotide residues of the ribosome or ribosomal subunit. 
The contiguous residues forming such a portion may be residues which form a contiguous portion of the primary se- 

i5 quence of a ribosomal RNA or ribosomal protein, residues which form a contiguous portion of the three-dimensional 
structure of the ribosome or ribosomal subunit. or a combination thereof. Thus, the residues forming a portion of the 
three-dimensional structure need not be contiguous in the primary sequence but. rather, must be contiguous in space. 
As used herein, the residues forming "a portion of the three-dimensional structure of" a ribosome or ribosomal subunit. 
form a contiguous three-dimensional shape in which each atom or functional group forming the portion of the shape 

20 is separated from the nearest atom or functional group forming the portion of the shape by no more than 40 A, preferably 
by no more than 20 A. more preferably by no more than 5-10 A. and most preferably by no more than 1-5 A. 
[01 03] As used herein ; the term "atomic co-ordinates" or "structure co-ordinates" refers to mathematical co-ordinates 
(represented as "X." "Y" and "Z" values) that describe the positions of atoms in a crystal of a ribosome or ribosomal 
subunit. The diffraction data obtained from the crystals arc used to calculate an electron density map of the repeating 

25 unit of the crystal . The electron density maps are used to establish the positions of the individual atoms within a single 
ribosomal subunit. Those of skill in the art understand that a set of structure co-ordinates determined by X-ray crys- 
tallography is not without standard error. For the purpose of this invention, any set of structure co-ordinates for a 
ribosome or ribosomal subunit from any source has a root mean square deviation of non-hydrogen atoms of less than 
0.75 A when superimposed on the non-hydrogen atom positions of the said atomic co-ordinates deposited at the 

30 Research Collaborator for Structural Bioinformatics (RCSB) Protein Data Bank (PDB) (Berman ef al. (2000) Nucleic 
Acids Research 28. 235-242: http://www.rcsb.org/pdb/) with the accession numbers PDB ID: 1 FFK: PDB ID: 1FFZ: 
PDB ID: 1FG0: or PDB ID: 1JJ2. the disclosure of each of the foregoing of which is incorporated herein by reference 
in its entirety. 

[01 04] In the list of atomic co-ordinates deposited at the RCSB Protein Data Bank or included herein as files recorded 

35 on the compact disks, the term "atomic co-ordinate" or structure co-ordinates refer to the measured position of an atom 
in the structure in Protein Data Bank (PDB) format, including X : Y. Z and B. for each. The term "atom type" refers to 
the element whose co-ordinates are measured. The first letter in the column defines the element. The term "X" ; "Y". 
"Z" refers to the crystallographically defined atomic position of the element measured with respect to the chosen crys- 
tallographic origin. The term "B" refers to a thermal factor that measures the mean variation of an atom's position with 

•*o respect to its average position. 

[0105] Reference is made to the sets of atomic co-ordinates and related tables included with this specification and 
submitted on compact disk (six total compact disks including three original compact disks, and a duplicative copy of 
each of the original compact disks), all of the foregoing of which are incorporated by reference herein. Disk No. 1 of 3 
contains eight files: Disk No. 2 of 3 contains four files: and Disk No. 3 of 3 contains nine files. Disk No. 1 of 3 contains 

45 the files identified as PDB1FFK.DOC and PDB1 FFK.ENT which represent files of co-ordinates defining the large ri- 
bosomal subunit: PDB1FFZ.DOC and PDB1FFZ.ENT which represent files of the coco-ordinates defining the large 
ribosomal subunit - CCdA-p-Puro complex; and PDB1FGO.DOC and PDB1 FGO.ENT which represent files of the co- 
ordinates defining the large ribosomal subunit - aa-tRNA analogue complex. Disk No. 2 of 3 contains files identified 
as 1JJ2 RTF and 1JJ2.TXT which represent files of the co-ordinates defining the completely refined large ribosomal 

50 subunit. Disk No. 3 of 3 contains the files identified as anisomycin.pdb, blasticidin.pdb. carbomycin.pdb. sparsomycin. 
pdb. spiramycin. pdb, tylosin.pdb and virginiamycin.pdb which represent files of the co-ordinates defining the large 
ribosomal subunit bound to anisomycin. blasticidin. carbomycin. sparsomycin, spiramycin, tylostn, and virginiamycin : 
respectively. 

[0106] As will be apparent to those of ordinary skill in the art, the atomic structures presented herein are independent 
55 of their orientation, and that the atomic co-ordinates identified herein merely represent one possible orientation of a 
particular large ribosomal subunit. It is apparent, therefore, that the atomic co-ordinates identified herein may be math- 
ematically rotated, translated scaled, or a combination thereof, without changing the relative positions of atoms or 
features of the respective structure. Such mathematical manipulations are intended to be embraced herein. 
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[0107] As used herein, the terms "atomic co-ordinates derived from" and "atoms derived from" refers to atomic co- 
ordinates or atoms derived, either directly or indirectly from an electron density map. It is understood that atomic co- 
ordinates or atoms derived "directly" from an electron density map refers to atomic co-ordinates or atoms that are 
identified from and/or fitted to an electron density map by using conventional crystallographic and/or molecular modeling 

5 techniques and thus can be considered to be primary atomic co-ordinates or atoms. It is understood that atomic co- 
ordinates or atoms derived "indirectly" from an electron density map refers to atomic co-ordinates or atoms that are 
derived from and thus are derivatives or transforms of the primary atomic co-ordinates or atoms and thus can be 
considered to be secondary atomic co-ordinates or atoms. The secondary atomic co-ordinates or atoms may be gen- 
erated from the primary atomic co-ordinates or atoms by using conventional molecular modeling techniques. By way 

w of a non limiting example, the atomic co-ordinates for the H. marismortui large ribosomal subunit as described here- 
inbelow are considered to be primary co-ordinates, whereas the atomic co-ordinates of a mammalian large ribosomal 
subunit which can be derived from H. marismortui atomic co-ordinates by molecular modeling, including, for example, 
homology modeling and/or molecular replacement, are considered to be secondary co-ordinates. Both types of atomic 
co-ordinates and atoms are considered to be embraced by the invention. 

15 [0108] As used herein the terms "bind." "binding." "bound." "bond." or "bonded:' when used in reference to the as- 
sociation ol atoms, molecules, or chemical groups, refer to any physical contact or association of two or more atoms, 
molecules, or chemical groups [e.g.. the binding of a ligand with a ribosomal subunit refers to the physical contact 
between the ligand and the ribosomal subunit). Such contacts and associations include covalent and non-covalent 
types of interactions. 

20 [0109] As used herein, the terms "complex" or "complexed" refer to the assembly of two or moie molecules to yield 
a higher order structure, such as. a SOS ribosomal subunit bound to a ligand. 

[01 10] As used herein, the term "computational chemistry" refers to calculations of the physical and chemical prop- 
erties of the molecules. 

[0111] As used herein, the term "conjugated system" refers to more than two double bonds that arc positioned spa- 
25 tially so that their electrons are completely delocalized with the entire system. Aromatic residues contain conjugated 
double bond systems. 

[0112] As used herein, the terms "covalent bond" or "valence bond" refer to a chemical bond between two atoms in 
a molecule created by the sharing of electrons, usually in pairs, by the bonded atoms 

[0113] As used herein, the term "crystal" refers to any three-dimensional ordered array of molecules that diffracts X- 
30 rays. 

[0114] As used herein, the term "crystallographic origin" refers to a reference point in the unit cell with respect to the 
crystallographic symmetry operation. 

[0115] As used herein, the term "elongation factor binding domain" refers to the region of the ribosome that interacts 
directly with elongation factors, including, for example, the elongation factors, EF-Tu and EF-G. 
35 [0116] As used herein, the term "E-site" refers to the locus occupied by a deacylated tRNA molecule it leaves the 
ribosome following its participation in peptide-bond formation. 

[01 17] As used herein, the term "heavy atom derealization" refers to the method of producing a chemically modified 
form, also known as a "heavy atom derivative", of a crystal of tne ribosome and the ribosomal subunit and its complexes. 
In practice, a crystal is soaked in a solution containing heavy metal atom salts, or organometallic compounds, e.g.. 

40 mercury chlorides, ethyl-mercury phosphate, osmium pentamine. or iridium pentamine. which can diffuse through the 
crystal and bind to the ribosome or ribosomal subunit. The location(s) of the bound heavy metal atom(s) can be de- 
termined by X-ray diffraction analysis of the soaked crystal. This information, in turn, is used to generate the phase 
information used to construct three-dimensional structure of the complex (Blundell et at. (1976) supra). 
[01 1 8] As used herein . the term "homologue" is understood to mean any one or combination of (i) any protein isolated 

45 or isolatable from a ribosome or a ribosomal subunit (i.e.. a ribosomal protein), (ii) any nucleic acid sequence isolated 
or isolatable from a ribosome or ribosomal subunit (i.e. . a ribosomal RN A). (Hi) any protein having at least 25 % sequence 
identity lo a ribosomal protein isolated from E. colioi Rattus notvegicus as determined using the computer program 
"BLAST" version number 2.1 .1 implementing all default parameters, or (iv) any nucleic acid having at least 30% se- 
quence identity to a ribosomal RNA isolated from E. coli or Rattus norvegicus as determined using the computer 

so program "BLAST" version number 2.1.1 implementing all default parameters. "BLAST" version number 2.1.1 is avail- 
able and accessible via the world wide web at http://www/ncbi.nlm.nih.qoy /BLAST/ or can be run locally as a fully 
executable program on a standalone computer. 

[0119] As used herein, the term "homology modeling" refers to the practice of deriving models for three-dimensional 
structures of macromoiecules from existing three-dimensional structures for their homologues. Homology models are 
55 obtained using computer programs that make it possible to alter the identity of residues at positions where the sequence 
of the molecule of interest is not the same as that of the molecule of known structure. 

[0120] As used herein, the term "hydrogen bond" refers to two electronegative atoms (either O or N). which share a 
hydrogen that is covalently bonded to only one atom, while interacting with the other. 
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[0121] As used herein, the term "hydrophobic interaction" refers to interactions made by two hydrophobic residues. 
[0122] As referred to herein, ribosomal proteins are designated "LX" or "SX". where L stands for "large subunit: S 
stands for "small subunit": and X in either case is an integer. 

[0123] As used herein, the term "MIR" refers to multiple isomorphous replacement, a technique used for deriving 
5 phase information from crystals treated with heavy atom compounds. 

[01 24] As used herein, the term "molecular graphics" refers to three-dimensional representations of atoms, preferably 
on a computer screen. 

[0125] As used herein, the terms "molecular model" or "molecular structure" refer to the three-dimensional arrange- 
ment of atoms within a particular object {e.g.. the three-dimensional structure of the atoms that comprise a ribosome 
io or ribosomal subunit. and the atoms that comprise a ligand that interacts with a ribosome or ribosomal subunit. partic- 
ularly with a large ribosomal subunit. more particularly with a 50S ribosomal subunit). 

[0126] As used herein, the term "molecular modeling" refers to a method or procedure that can be performed with 
or without a computer to make one or more models, and, optionally, to make predictions about structure activity rela- 
tionships of ligands. The methods used in molecular modeling range from molecular graphics to computational chem- 
15 istry. 

[0127] As used herein, the term "molecular replacement" refers to a method that involves generating a model of a 
ribosome or ribosomal subunit whose atomic co-ordinates are unknown, by orienting and positioning the atomic co- 
ordinates described in the present invention in the unit cell of the crystals of the unknown ribosome so as best to 
account for the observed diffraction pattern of the unknown crystal. Phases can then be calculated from this model 
20 and combined with the observed amplitudes to give the atomic co-ordinates of the unknown ribosome or ribosomal 
subunit. This type of method is described, for example, in The Molecular Replacement Method. (Rossmann. M.G. ; 
ed.). Gordon & Breach, New York, (1972). 

[0128] As used herein, "noncovalent bond" refers to an interaction between atoms and/or molecules that does not 
involve the formation of a covalcnt bond between them. 
25 [0129] As used herein, the term "peptidyl transferase site" refers to the locus in the large ribosomal subunit where 
peptide bonds are synthesized. 

[01 30] As used herein . the term "polypeptide exit tunnel" refers to the channel that passes through the large ribosomal 
subunit from the peptidyl transferase site to the exterior of the ribosome through which newly synthesized polypeptides 
pass 

30 [0131] As used herein, the term "protein synthesis inhibitor" refers to any molecule that can reduce, inhibit or other- 
wise disrupt protein or polypeptide synthesis in a ribosome. 

[0132] As used herein, the term "P-site" refers to the locus occupied by a peptidyl-tRNA at the time it participates in 
the peptide-bond forming reaction. 

[0133] As used herein, the term "ribofunctional locus" refers to a region of the ribosome or ribosomal subunit that 
35 participates, either actively or passively, in protein or polypeptide synthesis within the ribosome or ribosomal subunit 
and/or export or translocation of a protein or polypeptide out of a ribosome The ribofunctional locus can include, for 
example, a portion of a peptidyl transferase site, an A-site. a P-site. an E-site. an elongation factor binding domain, a 
polypeptide exit tunnel, and a signal recognition particle (SRP) binding domain. It is understood that the ribofunctional 
locus will not only have a certain topology but also a particular surface chemistry defined by atoms that, for example, 
participate tn hydrogen bonding (for example, proton donors and/or acceptors), have specific electrostatic properties 
and/or hydrophilic or hydrophobic character. 

[01 34] As used herein, the term "ribosomal subunit" refers to one of the two subunits of the ribosome that can function 
independently during the initiation phase of protein synthesis but which both together constitute a ribosome. For ex- 
ample, a prokaryotic ribosome comprises a SOS subunit (large subunit) and a 30S subunit (a small subunit). 
**s [0135] As used herein, the term "ribosome" refers to a complex comprising a large ribosomal subunit and small 
ribosomal subunit. 

[01 36] As used herein, the term "signal recognition particle binding domain" refers to the portion of the ribosome that 
interacts directly with the signal recognition particle. 

[0137] As used herein, the term "space group" refers to the arrangement of symmetry elements of a crystal. 
so [0138] As used herein, the term "symmetry operation" refers to an operation in the given space group that places 
the atoms in one asymmetric unit on the corresponding atoms in another asymmetric unit. 

[0139] As used herein, the term "twinned" refers to a single macroscopic crystal that contains microscopic domains 
of the same symmetry that differ significantly in orientation in such a way that the diffraction patterns of all are super- 
imposed. In a twinned crystal the mosaic blocks, or domains, are orientated so that some point in one direction and 
55 others point in a second, distinctly different direction, and the directions are such that the diffraction pattern generated 
by one group of blocks falls exactly on top of the diffraction pattern of the other group. 

[0140] As used herein, the term "untwinned" refers to a crystal cell the domains of which are aligned. The domains 
are also known as the "mosaic blocks." Most crystals diffract as though they were assemblies of mosaic blocks. One 
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can think of them as smalL perfectly ordered regions within the larger crystal, which, overall, is not so well ordered. 
Each block has the same symmetry and unit cell packing as all the others. 

[0141] As used herein, the term "unit cell" refers to a basic parallelepiped shaped block. The entire volume of crystal 
may be constructed by regular assembly of such blocks. Each unit cell comprises a complete representation of the 
unit of pattern, the repetition of which builds up the crystal. 

II. Structure and Use of the Large Ribosomal Subunit 

A. Atomic Structure of the Large Ribosomal Subunit at 2.4 A Resolution, initial Refinement 



[0142] The present invention is based, in part, on the development of a novel method for preparing crystals of ribos- 
omes. The novel method provides crystals of the SOS ribosomal subunit that are much thicker than those available 
earlier and that can diffract X-rays to a resolution of about 2.4 A. The method eliminates the twinning of crystals that 
obstructed progress in determining the crystal structure of the SOS ribosomal subunit from H. marismoriui for many 

is years. The method of preparing the crystals of the SOS ribosomal subunit is discussed below. 

[0143] The present invention is also based, in part, on the atomic structure of the crystal of the SOS ribosomal subunit 
from H. marismoriui 'that has been derived from a 2.4 A resolution electron density map that was experimentally phased 
using heavy atom derivatives. The atomic co-ordinates defining the large ribosomal unit were deposited on July 10, 
2000, al Research Collaboratory for Structural Bioinformalics (RCSB) Protein Data Bank (PDB) (Berman etat. (2000) 

20 Nucleic Acid Research 28, 235-242: http:// www.rcsb.org/pdb/ ) with accession number PDB ID: 1 FFK. 

[01 44] Moreover, the present invention is based, in part, on the derivation from the atomic co-ordinates of the following 
model which is briefly summarized here and discussed in detail in the following sections of the specification. This model 
includes 2.811 of the 2 ; 923 nucleotides of 23S rRNA. all 122 nucleotides of its 5S rRNA. and structures for the 27 
proteins that arc wcll-ordcrcd in the subunit. 

25 [0145] The secondary structures of both 5S and 23S rRNA are remarkably close to those deduced for them by 
phylogenetic comparison. The secondary structure of the 23S rRNA divides it into 6 large domains, each of which has 
a highly asymmetric tertiary structure. The irregularities of their shapes notwithstanding, the domains fit together in an 
interlocking manner to yield a compact mass of RNA that is almost isometric. The proteins are dispersed throughout 
the structure, concentrated largely on its surface, but they are much less abundant in the regions of the subunit that 

30 are of primary functional significance to protein syntheses - the 30S subunit interface, the binding regions for tRNA 
and the peptidyl transferase active site. The most surprising feature of many of these proteins are the extended, irregular 
structures of their loops and termini, which penetrate between RNA helices. The primary role of most of the proteins 
in the subunit appears to be stabilization of the three-dimensional structure of its rRNA. 

35 1. Preparation of the Crystal for the SOS Ribosomal Subunit and Structure Determination. 

[0146] Several experimental approaches were used to extend the resolution of the electron density maps of the H. 
marismoriui SOS ribosomal subunit from 5 A to 2.4 A including improvements in the crystals. A back-extraction proce- 
dure was developed for reproducibly growing crystals that are much thicker than those available earlier and can diffract 

40 to 2.2 A resolution (see Example 1). Briefly, the crystals were grown at room temperature in hanging drops by vapor 
diffusion from seeded solutions back-extracted from precipitated subunits. The crystals that resulted had maximum 
dimensions of 0.5 x 0.5 x 0.2 mm and were harvested after three weeks. The twinning of crystals that obstructed 
progress for many years (Ban et al. (1999) supra) was eliminated by adjusting crystal stabilization conditions (see 
Example 1). Crystals were stabilized by gradual transfer into a solution containing 12%. PEG 6000, 22% ethylene 

45 glycol. 1.7 M NaCl. 0.5 M NH 4 Cl : 100mM potassium acetate. 30 mM MgCI 2 and 1 mM CdCI 2 . pH 6.2. and flash frozen 
in liquid propane. Reducing the salt concentration below 1 .7 M NaCl (KCl) increased the tendency of crystals to become 
twinned. Al salt concentrations as low as 1 .2 M nearly all of the crystals were twinned. 

[0147] All the X-ray data used for high resolution phasing were collected at the Brookhaven National Synchrotron 
Light Source except for two native data sets used, which were collected at the Advanced Photon Source at Argonne 

so (see Example 2) (Table 1). Osmium pentamine (132 sites) and Iridium hexamine (84 sites) derivatives proved to be 
the most effective in producing both isomorphous replacement and anomalous scattering phase information to 3.2 A 
resolution (sec Example 2). Intcr-crystal density averaging which had contributed significantly at lower resolution, was 
not helpful beyond about 5 A resolution. Electron density maps were dramatically improved and their resolutions ex- 
tended, eventually to 2.4 A. using the solvent flipping procedure in CNS (Abrahams et al. (1996) Acta Crystollogr. D 

55 52 : 30: Brunger et al. (199B) Acta Crystaliogr. D Biol. Crystallogr. 54: 905-921). 



BNSDOCID <EP 1188769A2 I > 



17 



EP1 188 769 A2 



Table I 

Statistics tor Data Collection, Phase Determination, and M odel ( '.npstruction 



5 



15 







MIR AS 1 




Data Uatistics 


MIRAS2 








0 V N H * K 




Native") 


IrfN'Hj)^" 


Os(NlIi)* P 




Heavy aiL>ra 




30 0 


0.5 




20 0 


4.5 


3.0 


con;:. (mM; 
















SuAuig time (hts) 




1.5 






2*hr* 


24 hr* 


24hr> 


Si:es no. 




132 


20 




H4 


18 




Resolution (At 


90-2.4 


40-3.5 


40-3.& 


30-2 9 


30-3.1 


30-3.5 


30-3.8 


C) 


(2.5-2 4) 


(3.0-3.5) 


(3 9-3.8) 


(3.0-2.9) (3.32-3.22) 


C- .27-3.20) 


(3.6-3.5) 


(3.97-3.80) 


(A> 


t on 


:.)4 


1.30 


1.00 


1 075 


1 14 


l.?*5 


Observations 


6.0S9.S02 


1.30S.7O3 


J96.i6f. 


2.832J60 


1.823.861 


1.646,468 


1,288,524 


Unique 


665,928 


429.761 


313,863 


39C.770 


54 i, 4X8 


488.275 


346,7^5 


Redun.C) 


9.1 (G.5» 


5.0 (3.S) 


1.9 (1.6) 


72 


3.4 


4.3 (4 2) 


3.7 


Completeness (') 


95.6(71 0) 


99 4(96.K) 


92.0(54.1) 


97.1 


93.8 


98.1(99.0) 


99.5 


U ! (Last bin) 


25.5(1.9) 


13.5(3.3) 


£.9(1.6) 


18.0(6 4) 


12.0(2.6) 


10.6(2.7) 


10.8 02) 




8.6 (69.1) 


7.2(32.0) 


9.1 (37.9) 


11.2(36.9) 


8.5 (29.5) 


12.1 (46.0) 


12.1 (40.5) 


02 (ano) (•) 




2.8(1 0) 


1.5(1.0) 




2.63 (1.48} 


1.8(1.0) 


2.42(1.18) 






62 


8.0 




6.7 


69 








14.1(22.7) 


26.4 (47.0) 




12.9(28.1; 


19.5 (39.4) 





Phasing Statistic* 

R esolution sh e lls (A): -73,200 rcOfCllom pgr bin 

20 30.0 5.1 4.0 3.5 3.2 Total 



25 



30 



MIRASI (KOM) 


0.52 


0.31 


0 14 




0.32 


Os^H,), 1 * 












Phasing power 


0.87 


0.72 


0.66 




0.75 


Phasing power (SAD) 


1 40 


0.58 


0.26 




0.75 


Rcmou (centric) 


0.62 


0.65 


0.67 




0.65 














Phasing Power 


0 47 


0.33 


0.28 




0.36 


Phasing power (SAUJ 


0 46 


0.25 






OJfi 


Rc.m. (centric) 


D.72 


0.77 


0.75 




0.75 


Ml HAS 2 (FOM) 


0.48 


0.40 


0.28 


0.12 


0J3 


lr(NHj), v 












Phasing power 


1.02 


0.92 


0.78 


0.66 


0.85 


Phasing power (SAD) 


202 


1.60 


1.22 


0.R3 


1.47 


R*i.D rt (centric) 


0 58 


0.63 


0 70 


0.74 


0 61 


OMNH,)/* 












Phasing power 


0.62 


0.57 


0.58 


0.58 


0.59 


Phasing powrr (SAD) 


0.47 


0.39 






0.42 


K«„oh (centric) 


0.7& 


0.78 


078 


0?f. 


0.7S 


Ta^Brj/* (Used for SAD phasing only) 












Phasing Power (SAD) 


2.77 


0.35 


0.13 




1.19 


FOM (MKUSPMHHM^SaD) 


0.76 


0.51 


0.31 


0.14 


0.37 







Model Statistic* 


Average 8 factors (A : ) 




Resolution lunge (A) 


50.0-2/. 


rms deviations: 




Reflection* 


577.304 


Bonds (A) 0.0064 


All i'.om« 


37.4 




;$2 


Angifsp :.:9 


21SrRNA 


32J 


K.«c (°''..> 


2M 


Dihedrals C) 28.S 


55 rKNA 


43.2 




Improper* (") 1 .68 


MinmunvMax B factors (A 1 ) 


70/107.9 



40 , wavelength; Redun.. redundancy; (*) last-resolution shell. 

R IW> : ElKpH-hVl/EFpH, where hp M and r> are tUs derivative and the native structure factor amplitudes, respectively. 
R jyTT ,: EI^I^-I^il'SE^hi > where T(h) is ihc mean iniensity after reflections. Phasing power: r.m.s. isomorphous 
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45 isomorphous replacement, anomalous scattering; SAO- single wavelength anomalous diffraction; FOM: figure of 

merit 



50 [0146] Except for regions obscured by disorder, the experimentally-phased, 2.4 A resolution electron density map 
was of sufficient quality so that both protein and nucleic acid sequencing errors could be identified and corrected. Each 
nucleotide could be fitted individually and the difference between A and G was usually clear without reference to the 
chemical sequence, as was the distinction between purines and pyrimidines (Figure 1). 

[0149] Subtraction of the atomic model from the experimental electron density map leaves no significant density 
55 except for water and ions, showing that the model accounts for all the macromolecular density. Preliminary refinement 
of the model was achieved using a mixed target in the program CNS (Brunger et at. (1998) supra). The model was 
further refined in real space against the 2.4 A electron density map using the program TNT (Tronrud (1997). Macro- 
molecular Crystallography. Part B. Methods In Enzymology ), which yielded a model with a free R-f actor of 0.33. One 
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additional round of mixed target refinement of both atomic positions and B-factors using CNS led to the structure 
described below. Its free R-factor is 0.27 (Table 1). 

2. Sequence Fitting and Protein Identification, 

5 

[0150] The sequence of 23S rRNA was fit into the electron density map nucleotide by nucleotide starting from its 
sarcin/ricin loop sequence (A2691-A2702) (E. coli numbers A2654 to A2665). whose position had been determined at 
5 A resolution (Ban et ai (1999) supra). Guided by the information available about the secondary structures of 23S 
rRNAs (Gutell. R.R. (1996). "Comparative Sequence Analysis and the Structure of 16S and 23S rRNA." Ribosomal 

10 RNA. Structure. Evolution. Processing, and Function in Protein Biosynthesis . (Dahlberg A. and Zimmerman B.. eds.) ; 
CRC Press. Boca Raton. FL. pp. 111-128 ). the remaining RNA electron density neatly accommodated the sequence 
of 5S rRNA. The interpretation of protein electron density corresponding to the protein was more complicated because 
each protein region had to be identified chemically before the appropriate sequence could be fit into it, but about 4.000 
amino acid residues were fit into electron density. 

is [0151] The H. marismortui SOS subunit appears to contain thirty-one proteins, and there are sequences in the Swiss- 
Prot data bank tor twenty eight of those thirty one proteins, including one. HMS6 or L7ae. that was originally assigned 
to the small ribosomal subunit (Whittmann-Liebold et af. (1990) supra). The three remaining proteins were identified 
using the sequences of the ribosomal proteins from eukaryotes and other archeal species as guides. No electron 
density was found for one of the H. marismortui large ribosomal subunil proteins in the sequence database. LX. Either 

20 the assignment of LX to the large subunit is in error, or LX is associated with a disordered region of the subunit. or LX 
is absent from the subunits examined altogether 

[0152] The 2.4 A resolution electron density map lacks clear electron density for proteins L1 . L10. L1 and L12. the 
positions of which are known from earlier low resolution X-ray and/or electron microscopic studies. These proteins are 
components of the two lateral protuberances of the subunit. which arc both poorly ordered in those crystals. L1 is the 

25 sole protein component of one of them (Oakes ; M. et at. (1986)). Structure, Function and Genetics of Ribosomes ; 
(Hardesty ; B. and Kramer. G. t eds.) Springer-Verlag : New York. NY. 47-67) and is evident in 9 A resolution density 
maps of the subunit (Ban et at. (1998) supra), but not at higher resolutions. L10. L11 and L12 are components of the 
other protuberance, which is often referred to as the L7/L12 "stalk" (Oakes era/. (1986) supra). L11 and the RNA to 
which it binds were located in the 5 A resolution electron density map of the H. marismortui large subunit (Ban et ai. 

30 (1999) supra) using the independently determined crystal structures of that complex (Conn GL et at. (1999) Science 
284: 1171-1174: Wimberly et ai. (1999) Ce//97: 491-502). A protein fragment (about 100 residues) that is associated 
with the RNA stalk that supports the L1 1 complex can be seen in the 2.4 A resolution map. Based on location, it must 
be part of L1 0. There is no electron density corresponding to L1 2 seen at any resolution, but the L12 tetramer is known 
to be attached to the ribosome through L10. and the L10/L12 assembly is known to be flexible under some circum- 

35 stances (Moller ef ai (1 986) Structure. Function, and Genetics of Ribosomes. supra, pp. 309-325). which may explain 
its invisibility here. 

[0153] The structures of eubacterial homologues of proteins L2. L4. L6. L14. and L22 have previously been deter- 
mined in whole or in part (see. Table 2). L2. L6 and L14 were initially located in the 5 A resolution map (Ban et ai 
(1999) supra). L4 and L22 have now been identified and positioned the same way. Electron density corresponding to 
most of the remaining proteins was assigned by comparing chain lengths and sequence motifs deduced from the 
electron density map with known sequence lengths, guided by the information available about relative protein positions 
(Walleczek et ai (1 988) EMBO J. 7: 3571 -3576) and protein interactions with 23S rRNA and 5S rRNA (Ostergaard et 
ai (1998) J. Moi Bioi 284: 227-240). Each of the protein electron density regions so identified is well accounted for 
by its amino acid sequence. 

^5 [0154] The most interesting of the proteins identified by sequence similarity was L7ae. which first appeared to be 
L30e. The L30e identification seemed plausible because the structure of yeast L30 superimposes neatly on the electron 
density of L7ae. and the slruclure of the RNA lo which L7ae binds closely resembles that of the RNA to which yeast 
L30 binds (Mao. H. et ai (1999) Nat. Struct. Bioi 6: 1139-1147). Nevertheless, the sequence of HMS6. which by 
sequence similarity is a member of the L7ae protein family, fits the electron density better. Four of the other proteins 

50 identified by sequence similarity. L24e. L37e. L37ae. and L44e. contain zinc finger motifs. The rat homologues of L37e 
and L37ae were predicted to be zinc finger proteins on the basis of their sequences (Wool ef ai. (1995) supra), and 
this prediction helped identify their homologues in H. marismortui. 

55 
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The top block of proteins include all those known to have eubacterial homologues of the same name. The second block lists proteins lound in the 
H. marismortui large ribosomal subunit that have only eukaryotic homologues (Wittmann-Liebold et al. (1990) supra). Their names are all followed 

45 by the letter "e" to distinguish them from eubacterial proteins that would otherwise have the same name. The third block are large subunit proteins 

for which no H. marismortui sequence yet exists. They are identified by sequence homology using standard L names. 

1 The structures of all or part of homologues of the following proteins were previously determined: L1 (Nevskaya et al. (2000) Struct. Fold. 
Des. 8: 363), L2 (Nakagawa, A. et al (1999) EMBOJ. 16: 1459-1467), L4 (Wahl etal. (2000) EMBOJ. 19: 807-818). L6 (Golden elal. (1993) EMBO 
J. 12: 4901-4908). L11 (Conn ctai (1999) supra: Wimbcrly ct al. (1999) supra; Markus etal. (1997) Nature Struct. Biol. 4: 70-77), L12 (Lcijonmorck, 
M. etal. (1980) Nature2BS: 824-827). L14 (Davies etal. (1996) Structure A: 55-66). L22 (Unge etal. (1998) Structures: 1577-1586), L30 (Wilson 
et al. (1986) Proc. Nat. Acad. Set. USA 83: 7251-7255). All other structures, except 10. have been newly determined in this study. 

50 o 

* Rat homologue Rat equivalents to H. marismortui proteins are from (Mao et al. (1999) supra). 
^ Sequence chain length. 

4 Conformation: gtb = globular; ext = extension 

5 The protein interactions with the 6 domains of 23S rRNA, 5S rRNA and other proteins are specified. (+) implies that the interaction b substantial. 
(A) implies a weak, tangential interaction. Protein names are shown in parentheses implies that the interactions are weak: otherwise, the interaction 

55 is substantial. 

*Atl entries so designated describe proteins that are not fully represented in the electron density maps described here. The summary information 

provided is derived from literature sources and is included here for completeness only. 

t The structure available for this protein in isolation does not include the extension(s) reported here. 
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Table 2 (continued) 
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The top block ol proteins include all those known to have eubacterial homologues of the same name. The second block lists proteins found in the 
H. mdrismoffw large ribosomal subunil thai have only eukaryotic homologues (Willmann-Liebold et at. (1990) supra). Their names are all followed 
by the letter "e" to distinguish them from eubacterial proteins that would otherwise have the same name The third block are large subunil proteins 

2o for which no H. marismortui sequence yet exists. They are identified by sequence homology using standard L names. 

1 The structures ol all or part of homologues of the following proteins were previously determined: L1 (Nevskaya et at. (2000) Struct. Foid. 
D<?s.8:363).L2(Nakagawa,A.ff/a/ (1999) EMBOJ. 16: 1459-1467), L4 (Wahl et at. (2000) EMBOJ. 19: 807-818). L6 (Golden et at. (1993) EMBO 
J. 12: 4901-4906). L11 (Conn etat. (1999) supra: Wimberry era/. (1999) supra: Ma rkus etai. (1997) Nature Struct. Biol. 4: 70-77). L12 (Leijonmarck. 
M et at. (1980) Nature 286: 824-827). L14 (Davies etai (1996) Structure 4: 55-66). L22 (Unge etai (1998) Structures 1577-1586). L30 (Wilson 
et al. (1986) Proc. Nat. Acad. Sci. USA 83: 7251-7255). All other structures, except 10. have been newly determined in this study. 

^ 5 2 Rat homologue. Rat equivalents to H. marismortui proteins are from (Mao et a I. (1999) supra). 

Sequence chain length. 

4 Conformation, gib = globular; ext = extension 

5 The protein interactions with the 6 domains of 23S rRNA. 5S rRNA and other proteins are specified (*) implies that the interaction is substantial 
(A) implies a weak, tangential interaction. Protein names are shown in parentheses implies that the interactions are weak; otherwise, the interaction 
is substantial 

40 

3. General Appearance of the Subunit. 

[01 55] In its crown view (see Figure 2). the large ribosomal subunit. which is about 250 A across, presents its surface 
that interacts with the small subunit to the viewer with the three projections that radiate Irom that surface pointed up. 

■« Although the protuberance that includes L1 is not visible in the 2.4 A resolution electron density map. the structure of 
L1. which has been determined independently (Nikonov etai. (1996) EMBOJ. 15: 1350-1359), has been positioned 
approximately in lower resolution maps (Ban etai. (1998) supra) and is included hereto orient the reader. It is evident 
that, except lor its two lateral protuberances, the large ribosomal subunil is monolithic. There is no hint or a division of 
its sttucture into topological^/ separate domains. In addition, partly because it lacks obvious domain substructure but 

50 also because it is so large it is impossible to comprehend looking at it as a whole. In order to convey to the reader a 
sense of how it is put together, the subunit must be dissected into its chemical components. 

4. RNA Secondary Structure. 

55 [01 56] All the base pairs in H. marismortui 23S rRNA stabilized by at least two hydrogen bonds were identified using 
a computer program that searched the structure for hydrogen bond donors and acceptors separated by less than 3.2 
A. Bases linked by at least two such bonds were considered paired if the angle between their normals was less than 
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45°. and the angle between bonds and base normals was also less than 45°. Based on the results of this analysis, a 
secondary structure diagram has been prepared in the format standard for23S/28S rRNAs (see Figure 3). The sec- 
ondary structure predicted for this molecule by phylogenetic comparison was remarkably accurate, but it did not find 
all of the tertiary pairings and failed to identify interactions involving conserved bases. In addition to base pairs of nearly 
5 every type, the RNA contains numerous examples of well-known secondary structure motifs such as base triplets, 
tetraloops. and cross-strand purine stacks, but no dramatically new secondary structure motifs have been identified 
so far. 

[0157] The secondary structure of this 23S rRNA consists of a central loop that is closed by a terminal stem, from 
which 11 more or less complicated stem/loops radiate. It is customary to describe the molecule as consisting of 6 
10 domains, and to number its helical stems sequentially starting from the 5' end (see Figure 4) (Leffers et aL (1987) 
supra). The division of the molecule into domains shown in Figure 4 deviates from standard practice with respect to 
helix 25. which is usually considered to be part of domain I. It is placed in domain II because it interacts more strongly 
with domain II than it does with the other elements of domain I. 

[0158] There are five sequences longer than 10 nucleotides in 23S rRNA whose structures cannot be determined 
* 5 from the 2.4 A resolution map due to disorder. Altogether they account for 207 out of the 232 nucleotides missing from 
the final model. The disordered regions are: (1) all of helix 1. (2) the distal end of helix 38. (3) helix 43/44 to which 
ribosomal protein L11 binds. (4) the loop end of stem/loop 69. and (5) helix 76/77/78. which is the RNA structure to 
which L1 binds. For completeness, these regions are included in Figure 3 (in gray) with their secondary structures 
determined phylogenelically. 

20 

5. Overall Architecture of rRNA. 

[0159] The six domains of 23S rRNA and 5S rRNA all have complicated, convoluted shapes that nevertheless fit 
together to produce a compact, monolithic RNA mass (sec Figure 4(A) and 4(B)). Thus despite the organization of its 

25 RNAs at the secondary structure level, in three-dimensions, the large subunit is a single, gigantic domain. In this 
respect, it is quite different from the small subunit ; which is a flatter object that is not at all monolithic. Even in low 
resolution electron micrographs the small subunit consists of three structural domains, each of which., it turns out 
contains one of the three secondary structure domain of its RNA (Noller et at. (1 990) The Ribosome: Structure. Function, 
and Evolution , supra, pp. 73-92). This qualitative difference between the two subunits may reflect a requirement for 

30 conformational flexibility that is greater for the small subunit. 

[0160] Domain I. which looks like a mushroom (see Figure 4(E)). lies in the back of the particle, behind and below 
the L1 region. The thin part of the domain starts in the vicinity of domain VI. which is where its first and last residues 
are located. Helices 1 and 25 span the particle in the back and then the domain expands into a larger more globular 
structure below and behind the L1 region. 

35 [0161] 'The largest of the six 23S rRNA domains, domain I!, which accounts for most of the back of the particle, has 
three protrusions that reach towards the subunit interface side of the particle (see Figure 4(F)). One of them (helix 42 
- 44) is the RNA portion of the L7/L12 stalk, which is known to interact with elongation factors, is not well-ordered in 
these crystals The second domain II protrusion is helix 38, which is the longest, unbranched stem in the particle. It 
starts in the back of the particle, bends by about 90 degrees and protrudes towards the small subunit between domains 

-to V and 5S rRNA. The third region ; helix 32-35.1 , points directly towards the small subunit and its terminus ; the loop of 
stem/loop 34. interacts directly with the small ribosomal subunit (Culver et al. (1999) Science 285: 2133-2135). This 
loop emerges at the subunit interface between domains III and IV. 

[0162] Domain III is a compact globular domain that occupies the bottom left region of the subunit in the crown view 
(see Figure 4(G)). It looks like a four pointed star with the origin of the domain (stem/loop 48) and stem/loops 52, 57, 
•*5 and 58 forming the points. The most extensive contacts of domain III are with domain II. but it also interacts with 
domains I. IV and VI. Unlike all the other domains, domain III hardly interacts with domain V at all; the sole contact is 
a single van der Waals contact involving a single base from each domain. 

[0163] Domain IV accounts for most of the interface surface of the SOS subunit that contacts the 30S subunit (see 
Figure 4(H)). It forms a large diagonal patch of flat surface on that side of the subunit. and connects to domains III and 

50 v in the back of the particle. Helices 67-71 are the most prominent feature of domain IV. and form the front rim of the 
active site cleft, which is clearly visible at low resolution (see Figure 2). This is one of the few regions of the 23S rRNA 
that is not extensively stabilized by ribosomal proteins. Helix 69 in the middle of this ridge interacts with the long 
penultimate stem of 1 6S rRNA in the small ribosomal subunit and can be viewed as a divider separating A-site bound 
tRNAs from P-site bound tRNAs 

55 [0164] Domain V. which is sandwiched between domains IV and II in the middle of the subunit. is known to be inti- 
mately involved in the peptidyl transferase activity of the ribosome. Structurally the domain can be divided into three 
regions (see Figures 4(l) and 4(J)). The first starts with helix 75 and ultimately forms the binding site for protein L1 . 
The second, which consists of helices 80-88, forms the bulk of the central protuberance region, and is supported in 
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the back by the 5S rRNA and domain II. The third region, which includes helices 89-93. extends towards domain VI 
and helps stabilize the elongation factor binding region of the ribosome. 

[0165] The smallest domain in 23S rRNA. domain VI. which forms a large part of the surface of the subunit imme- 
diately below the L7/L1 2 stalk, resembles a letter X with a horizontal bar at the bottom (see Figure 4(K)). An interesting 

5 region of this domain is the sarcin-ricin loop (SRL) (stem/loop 95). the structure of which has been extensively studied 
in isolation (Szewczak et ai (1995) d Mol. Biol. 247: 81-98). The SRL is essential for factor binding, and ribosomes 
can be inactivated by the cleavage of single covalent bonds in this loop (Wool et ai (1992) TIBS 17: 266-269). As 
suggested by nucleotide protection data, the major groove of this loop is exposed to solvent (Moazed et ai (19B8) 
Nature 334: 362-364) : and its conformation is stabilized by proteins and through interaction with domain V that involves 

10 two bases on the minor grove side. The nucleotides involved are A 2699 and G 2700 in domain VI. and A 2566 and 
G 2567 in domain V. 

[0166] 5S ribosomal RNA. which is effectively the seventh RNA domain in the subunit. consists of three stems radi- 
ating out from a common junction called loop A (see Figure 4(D)). In contrast to what is seen in the crystal structure 
of fragment 1 from E. coli 5S rRNA (Correll etai (1997) Ce//91 : 705-712). the helix 2/3 arm of the molecule stacks on 
is its helix 4/5 arm, not helix 1 (see Figure 4(L)) . This arrangement results from a contorted conformation of loop A residues 
that involves two stacked base triples. Indeed, from the secondary structure point of view, the loopA-helix 2.3 arm of 
5S rRNA is quite remarkable. Nowhere else in the subunit is there a higher concentration of unusual pairings and of 
convoluted RNA secondary structure. 

20 6. Sequence Conservation and interactions in 23S rRNA. 

[01 67] While 23S/28S rRNAs contain many conserved sequences, they also vary substantially in chain length. Short- 
er 23S/28S rRNAs are distinguished from their longer homologues by the truncation of. or even the elimination of entire 
stem/loops, and by comparing sequences, one can identify a minimal structure that is shared by all (Gcrbi (1995) 

25 Ribosomal RNA: Structure. Evolution. Processing and Function in Protein Biosynthesis, supra, pp. 77-88). The expan- 
sion sequences in the 23S rRNA of H. mahsmortui, i.e., the sequences it contains that are larger than the minimum, 
are shown in Figure 5 in green. They are largely absent from the subunit interface surface of the particle, but they are 
abundant on its back surface, far from its active sites. This is consistent with low resolution electron microscopic ob- 
servations suggesting that the region of the large subunit whose structure is most conserved is the surface that interacts 

30 with the small subunit (Dube etai (1998) Structures: 389-399). 

[0168] There are two classes of conserved sequences in 23S rRNA. One contains residues concentrated in the 
active site regions of the large subunit. The second class consists of much shorter sequences scattered throughout 
the particle (Figure 5: red sequences). The SRL sequence in domain VI and the cluster of conserved residues belonging 
to domain V that are located at the bottom of the peptidyl transferase cleft are members of the first class. They are 

35 conserved because they are essential for substrate binding, factor binding and catalytic activity. Most of the residues 
in the second class of conserved residues are involved in the inter- and intra-domain interactions that stabilize the 
tertiary structure of 23S rRNA. Adenosines are disproportionately represented in this class. The predominance of A's 
among the conserved residues in rRNAs has been pointed out in the past (Ware et ai (1983) Nuci Acids. Res. 22: 
7795-7817). 

-to [0169] In addition to its reliance on A-dependent motifs, the tertiary structure of the domains of 23S rRNA and their 
relative positions are stabilized by familiar tertiary structure elements like RNA zippers and tetraloop/tetraloop receptor 
motifs (Moore. P.B. (1999) Annu. Rev. Biochem. 68 287-300). and in many places, base pairs and triples stabilize the 
interactions of sequences belonging to different components of the secondary structure of 23S rRNA. 
[0170] Interestingly. 5S rRNA and 23S rRNA do not interact extensively with each other. The few RNA/RNA interac- 
ts tions there are involve the backbones of the helix 4/5 arm of 55 rRNA and the backbone of helix 38 of 23S rRNA. Most 
of the free energy and all of the specificity of 5S rRNA binding to the large ribosomal subunit appears to depend on its 
extensive interactions wilh proteins thai act as modeling clay slicking il to the resl of ribosome. 

7. Proteins in the SOS Ribosomal Subunit. 

50 

[0171] The structures of twenty seven proteins found in the large ribosomal subunit of H. mahsmortui (Table 2) have 
been determined. Twcnty-onc of these protein structures have not been previously established for any homologues, 
and the structures of the six that do have homologues of known structure have been rebuilt into the electron density 
map using their H. mahsmortui sequences. In addition, there are structures available for homologues of H. mahsmortui 
55 L1. L11 and L12. which cannot be visualized in the 2.4 A resolution electron density map. Only the structure of L10 is 
still unknown among the thirty one proteins known to be present. 

[01 72] Not every one of these structures is complete. For example, an entire domain of L5 is missing from the electron 
density, presumably because of disorder. L32e is also noteworthy in this regard. About twenty residues from its N- 
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terminus are not seen in the electron density map, and the electron density map suggests that its C-terminal residue 
is covalently bound to the most N-terminal of its visible residues. 

[0173] Of the thirty large subunit ribosomal proteins whose structures are known. 17 are globular proteins similar 
in character to thousands whose structures are in the Protein Data Bank (Table 2). The remaining thirteen proteins 

5 either have globular bodies with extensions protruding from them fglb+ext") or are entirely extended ("ext"). Their 
extensions often lack obvious tertiary structure and in many regions are devoid of secondary structure as well (see 
Figure 6). These extensions may explain why many ribosomal proteins have resisted crystallization in isolation. The 
exceptions that prove the rule are L2 and L4, both of which are proteins belonging to the "glb+ext" class. Protein L2 
was crystallized and its structure solved only after its extensions had been removed (Nakagawa et al. (1999) supra). 

10 and one of the two regions of L4 that are extended in the ribosome is disordered in the crystal structure of intact L4 
(Wahl et ai (2000) supra). 

[0174] Except for proteins L1 , L7. L10 and L11 , which form the tips of the two lateral protuberances, the proteins of 
the SOS subunit do not extend significantly beyond the envelope defined by the RNA (see Figure 7). Their globular 
domains are found largely on the particle's exterior often nestled in the gaps and crevices formed by the folding of the 

T5 RNA. Thus, unlike the proteins in spherical viruses, the proteins of the large ribosomal subunit do not form a shell 
around the nucleic acid with which they associate, and unlike the proteins in nucleosomes, they do not become sur- 
rounded by nucleic acid either. Instead the proteins act like mortar filling the gaps and cracks between "RNA bricks". 
[0175] The distribution of proteins on the subunit surface is nearly uniform, except for the active site cleft and the 
flat surface that interacts with the 30S subunit. In the crown view Ihe proteins lie around al the periphery of the subunil 

20 (see Figure 7(A)). but when viewed from the side opposite the 30S subunit binding site (the "back side"), they appear 
to form an almost uniform lattice over its entire surface (see Figure 7(B)). Similarly, the bottom surface of the subunit. 
which includes the exit of polypeptide tunnel, is studded with proteins (see Figure 7(C)). Indeed, the 5 proteins that 
surround the tunnel exit may play a role in protein secretion since they are part of the surface that faces the membrane 
and the translocon when membrane and secreted proteins arc being synthesized. 

25 [0176] Although Figure 7 shows protein chains disappearing into the ribosome interior the degree to which proteins 
penetrate the body of the particle can only be fully appreciated when the RNA is stripped away. The interior of the 
particle is not protein-free, but it is protein-poor compared to the surface of the particle. Extended tentacles of polypep- 
tide, many of which emanate from globular domains on the surface, penetrate into the interior, filling the gaps between 
neighboring elements of RNA secondary structure (see Figure 8(E)). The bizarre structures of these extensions are 

30 explained by their interactions with RNA. 

[0177] Although extended, non-globular structures are rare in the protein data base, they are not unknown. Extended 
protein termini often form inter-protein contacts, e.g.. in viral capsids. presumably adopting fixed structures only upon 
capsid formation. The basic "tails" of histones may behave the same way when nucleosomes form. The N-terminal 
sequences of capsid proteins are often positively charged, and in virus crystal structures, the electron density for these 

35 sequences often disappears into the interior of the virus where these sequences presumably interact with asymmet- 
rically arranged nucleic acid. The interactions observed in the ribosome could be useful models for these viral inter- 
actions. 

[0178] The interactions of extended polypeptides and RNA in the large subunit. which stabilizes its massive nucleic 
acid structure, result in a tangle of RNA and protein in the center of the subunit (see Figures 8(A) and 8(B)). It is hard 
•to to imagine such an object assembling from its components efficiently in anything other than a highly ordered manner. 
Chaperones may well be required to prevent the aggregation of the extended regions of these proteins, which are likely 
to be disordered outside the context provided by rRNA. and to manage the folding of rRNA. 

8. Protein and RNA Interactions. 

45 

[0179] Because protein permeates the large subunit to a surprising degree, there are only a few segments of the 
23S (RNA that do not interact with protein al all. Of the 2923 nucleotides in 23S rRNA. 1157 nucleotides make al least 
van der Waals contact with protein (see Figure 8(D)). and there are only ten sequences longer than twenty nucleotides 
in which no nucleotide contacts protein. The longest such sequence contains forty-seven nucleotides, and is the part 

so of domain IV that forms the ridge of the active site cleft. 

[0180] The extent of the interactions between RNA and protein that occur when the targe subunit assembles can 
estimated quantitatively. Using the Richards algorithm (Lee. B. ct al. (1971) J. Mot. Biol. 55: 379-400) and a 1.7 A 
radius probe to compute accessible surface areas, it can be shown that 180.000 A 2 of surface becomes buried when 
the subunit forms from its isolated, but fully structured components. This is about half their total surface area. The 

55 average is about 6.000 A 2 per protein. While this is an enormous amount compared to the surface buried when most 
protein oligomers form, it should be recognized that ribosome assembly must be accompanied by a large loss in con- 
formational entropy that does not occur when most proteins oligomerize. The extended protein termini and loops of 
the ribosomal proteins are almost certainly flexible in isolation, and in the absence of protein, the RNA is probably quite 
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flexible as well. Thus, the burial of a large amount of surface area may be required to provide the energy necessary 
to fix of the structures of these molecules. 

[0181] All of the proteins in the particle except L12. interact directly with RNA and all but seven of the remaining thirty 
proteins interact with two rRNA domains or more (Table 2). The "champion" in this regard is L22. which is the only 

5 protein that interacts with RNA sequences belonging to all 6 domains of the 23S rRNA (see Figure 8(C)). The protein- 
mediated interactions between 5S rRNA and 23S rRNA are particularly extensive. Protein L18 attaches helix 1 and 
helix 2/3 of 5S rRNA to helix 87 of 23S rRNA. Protein L31 e mediates an interaction between the same part of 5S rRNA 
and domains II and V. Loop C is linked to domain V by protein L5 and loop D is attached to domains II and V by protein 
L1 Oe. Whatever else they may do. it is evident that an important function of these proteins is stabilization of the relative 

w orientations of adjacent RNA domains. Several also help secure the tertiary structures of the domains with which they 
interact. 

[01 82] Since most ribosomal proteins interact with many RNA sequences and the number of proteins greatly exceeds 
the number of RNA domains, it can hardly come as a surprise that every rRNA domain interacts with multiple proteins 
(Table 2). Domain V. for example, interacts with thirteen proteins, some intimately and a few in passing. 

is [0183] It is clear that the oligonucleotide binding experiments long relied on for information about the RNA binding 
properties of ribosomal proteins have underestimated their potential for interacting with RNA. The high-affinity RNA 
binding site identified on a protein by such an experiment may indeed be important for ribosome assembly, but its 
many, weaker interactions with otnersequences are likely to be missed : and they too may be vital for ribosome structure. 
Most ribosomal proteins crosslink RNA and crosslinking is impossible wilhoul multiple interactions. Similar consider- 

20 ations may apply to proteins that are components of other ribonucteoproteins such as the sliceosome. 

[0184] Of the seven proteins that interact with only one domain, three (L1 , L1 0, L1 1 ) participate directly in the protein 
synthesis process. Rather than being included in the ribosome to ensure that the RNA adopts the properconformation. 
it seems more appropriate to view the RNA as being structured to ensure their correct placement. Another three (L24 : 
L29. L18c) interact with several secondary structure elements within the domains to which they bind ; and presumably 

25 function to stabilize the tertiary structures of their domains. The last of the single RNA domain proteins. L7ae ; is puzzling. 
On the one hand, it cannot function as an RNA stabilizing protein because it interacts with only a single, short sequence 
in domain I. and on the other hand, it is far from the important functional sites in the subunit, the peptidyl transferase 
site and factor binding site It is quite close to L1 , however, which appears to be important for E-site function (Agrawal 
ef at. (1999) J. Biol. Chem 274: 8723-8729), and maybe it is involved in that activity. 

30 [0185] While many ribosomal proteins interact primarily with RNA. a few interact significantly with other proteins. 
The most striking structure generated by protein-protein interactions is the protein cluster composed of L3. L6. L14. 
L1 9 and L24e that is found close to the factor binding site. The protein surface they provide may be important for factor 
interactions. 

[0186] The structure presented above illuminates both the strengths and weaknesses of approaches to complex 
35 assemblies that depend on determining the structures of components in isolation. The structures of the globular do- 
mains of homologues of the proteins in large ribosomal subunit from H. marismortui are largely the same as those of 
the corresponding domains in the intact subunit. though adjustments in domain positions are sometimes required. 
Consequently, these structures were very useful for locating proteins and interpreting lower resolution electron density 
maps. However, for obvious reasons, the structures of the extended tails and loops of ribosomal proteins cannot be 
40 determined in the absence of the RNAs that give them structure, and the feasibility of strategies that depend on pro- 
ducing low molecular weight RNA-protein complexes that have all the RNA contacts required to fix the structures of 
such proteins seems remote. RNA is also a problem While the sarcin/ricin loop has much the same structure in isolation 
as it does in the ribosome, the structure of 5S rRNA in isolation differs in some respects from what is seen in the 
ribosome. and the structure of the isolated P-loop (Puglisi ef at. (1 997) Nat Struct. Biol. 4: 775-778) does not resemble 
■w the structure of the P-loop in the ribosome at all. Clearly a "structural genomics" approach to the ribosome. which would 
have entailed determining the structures of all its proteins and all possible rRNA fragments, would not have worked. 
It may not be successful (or other macromolecuiar assemblies either. 

B. The Structural Basis of Ribosome Activity in Peptide Bond Synthesis 

50 

[01 87] Analysis of the atomic co-ordinates discussed in section I IA above together with additional atomic co-ordinates 
of a ribosomal subunit complcxcd with various analogues, similarly refined, permit an analysis of ribosome function. 
Accordingly, the present invention is also based on the crystals of Haloarcula marismortui 60S ribosomal subunit com- 
plexed either with the Yarus transition state analogue. CCdA-p-Puro. or with a mini-helix analogue of an aminoacyl- 
55 tRNA. The present invention provides the structures of both complexes. The atomic co-ordinates of the structure of 
both complexes were deposited on July 26, 2000 at Research Collaborator/ for Structural Bioinformatics (RCSB) 
Protein Data Bank (PDB) (Berman ei at. (2000) Nucleic Acid Research 28: 235-242: http:// www.rcsb.org/pdb/) with 
accession numbers PDB ID: 1 FFZ (SOS ribosome/ CCdA-p-Puro complex) and PDB ID: 1 FG0 (SOS ribosome/aa-tRNA 
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analogue, 

[0188] As discussed below, the complete atomic structures of the large ribosomal subunit and its complexes with 
two substrate analogues show that the ribosome is a ribozyme. The complete atomic structures also provide information 
regarding the catalytic properties of its all-RNA active site. Both substrate analogues are contacted exclusively by 

5 conserved rRNA residues from domain V of 23S rRNA: there are no protein side-chains closer than about 1 8 A to the 
peptide bond being synthesized. The mechanism of peptide bond synthesis appears to resemble the reverse of the 
deacylation step in serine proteases, with the base of A2486 (A2451 ) in £ coli playing the same general base role as 
His57 in chymotrypsin. The unusual pKa that A2486 must possess to perform this function probably derives from its 
hydrogen bonding to G2482 (G2447) which interacts with a buried phosphate that could stabilize the unusual tautomers 

10 of two bases which is required for catalysis. The polypeptide exit tunnel is largely formed by RNA but has significant 
contributions from proteins L22. L39 and L4 and its exit is encircled by proteins L19. L22. L23a. L24 and L29. 
[0189] The CCdA from the Varus analogue binds to the so-called P-loop and hence must be in the P-site. Only the 
terminal-CCA of the aa-tRNA analogue is visible, but since it interacts appropriately with the A-loop (Kim et ai (1999) 
Molec. Cell A: 859-864) ; it must be in the A-site. The puromycin group occupies the same location in both structures. 

* 5 and there are no proteins near that site. Hence ; the catalytic activity of the active site must depend entirely on RNA. 
The N3 of A2486 (£. coli A2451 ) is the titratable group nearest to the peptide bond being synthesized and is likely 
functioning as a general base to facilitate the nucleophilic attack by the - amino group of the A-site substrate. In order 
to function in this capacity, the pKa of this base has to be roughly 5 units higher than normal. 

20 1. Structures of Substrate Analogue Complexes. 

[0190] In order to establish how substrates interact at the A-site and P-site of the large subunit. two substrate ana- 
logues were used. One of the analogues, which was designed to mimic the acceptor stem of an aa-tRNA and bind to 
the A-sitc. was a twelve base-pair RNA hairpin with an aminoacylatcd : four-nuclcotidc extension on its 3' end (sec 
25 Figure 9). The sequence used was that of the tRNA tyr acceptor stem and it is terminated with puromycin, which itself 
is an analogue of tyrosyl-A76. The second analogue used was the Yarus transition state analogue. CCdA-p-puromycin. 
As in the case of the A-site substrate analogue, the puromycin of the Yarus inhibitor is expected to bind at the A-site, 
while its CCdA moiety should bind at the P-site. 

[0191] The positions of the Yarus inhibitor and the tRNA acceptor stem analogue were determined by soaking these 
30 molecules into crystals of the H. marismortui SOS ribosomal subunit. measuring diffraction data to 3.2 A resolution and 
calculating difference electron density maps (Welch et ai (1997) Biochemistry 36: 6614-6623). Maps of the complexes 
were also calculated using 2F 0 (complexed)-F 0 (uncomplexed) as coefficients, to examine the shifts in the positions of 
ribosome residues that occur when these analogues bind (see Figure 10(B) and Table 3). 

35 Table 3. 



Statistics for Data Collection and Scaling. 


Crystal 


Native A 


Native B 


CcdAp-Puro 


Mini-helix 


Soak time (hours) 






24 


24 


Soak 

concentration (uM) 






100 


100 


Wavelength (A) 


1 0 


1.0 


1.0 


1.0 


Observations 


1.571.171 


1,344.877 


2.590,726 


2,712.813 


Unique 


284.533 


369.167 


367.284 


447,121 


Redundancy 


5.5 


3.6 


7.0 


6.0 


Resolution limits 

(A) 


70.0-3.2 


70.0-3.0 


70.0-3.0 


70.0-2.8 


(High- 
resolution bin)* 


(3.26-3.20) 


(3.05-3.00) 


(3.23-3.17) 


(3.08-3.02) 


Completeness 


94.1 (96.0) 


98.9 (99.3) 


98.6 (99.9) 


99.6 (100) 



55 'Statistics in parenthesis are calculated for the high-resolution bin used in map calculations which, as indicated was sometimes lower in resolution 

than the high-resolution bin used in data reduction, t "merge : -^"(h) * '(hji'^^hji' where '(h) is the mean intensitv a ,ter reflection. tR iso : -IFp H 
- Fp l-PpH where Fp H and Fp are the soaked and the native crystal structure factor amplitudes respectively. 
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Tabic 3. (continued) 



Statistics for Data Collection and Scaling. 


Crystal 


Native A 


Native B 


CcdAp-Puro 


Mini-helix 


1/1 


14.6 (4.0) 


10.8 (3.1) 


11.0 (2.8) 


10.7 (2.9) 


^merget 


10.2 (40) 


11.5(38) 


18.8 (84) 


14.3 (72) 


R iso Native At 






6.8 (20.8) 


14 4 (25.2) 


fl iso Native B* 






12.6 (27.4) 


17.5 (31.0) 



[0192] A model for the entire Yarus inhibitor could be fitted into the difference density (see Figure 10(A)). and the 
electron density map of the complex shows the N3 of A2486 (2451 ) within hydrogen bonding distance of a non-bridging 
oxygen of the phosphoramide (see Figure 1 0(B)). The inhibitor's two C's. which correspond to C74 and C75 of peptidyl- 

'5 tRNA, are Watson-Crick base-paired with G2285 (2252) and G2284 (2251 )in the P-loop. respectively (see Figure 11 
(A)). The C74-G2285 (2252) interaction was predicted by the results of Noller and coworkers (Nolier el al. (1992) 
Science 256: 1416-1419). The dA . which corresponds to A76 of a tRNA in the P-site. is not base-paired, but rather 
stacks on the ribose of A2486 and hydrogen bonds to the 2'OH of nucleotide A2485 (see Figure 12(B)). 
[0193] Only the CC-puromycin moiety of the mini-helix acceptor stem analogue showed ordered electron density m 

20 its difference electron density map (see Figure 1 0(C)). The C75 of the acceptor stem CCA is Watson-Crick base-paired 
with G2588 (2553) of the A-loop. whereas the C74 is more disordered and not base-paired but appears to stack on a 
ribosome base. The dimethyl A of the A-sile inhibitor puromycin is posilioned identically to the dimethyl A of the Yarus 
inhibitor. Further, the dimethyl A of puromycin, which is the A76 equivalent of an A-site tRNA, interacts with the A-loop 
in much the same way that the A76 from the P-site CCA interacts with the P-loop (see Figure 11 (B)). 

25 [0194] The most notable of the several conformational changes in the ribosome induced by the binding of the tran- 
sition state analogue is the ordering of base A2637 (2602). which is disordered in the unliganded enzyme (see Figure 
11(B)). It becomes positioned between the CCA bound at the A-sitc and the CCA bound at the P-sitc. The base of 
U2620 (2585) also moves so that it can make a hydrogen bond with the 2' hydroxyl of the ribose of A76 in the A-site. 
and U261 9 and G261 8 shift to allow the A76 to be positioned. Smaller shifts in positions are observed in the positions 

30 of A2486. whose N3 is near to the non-bridging oxygen of the phosphate, and one of the G residues wtth which it 
interacts. G2102 (2482). 

2. Location and Chemical Composition of the Peptidyl Transferase Site. 

35 [01 95] The inhibitors are bound to a site made entirely of 23S rRNA with no proteins nearby, proving that the ribosome 
is a ribozyme. Both the Yarus inhibitor and the A-site analogue of aa-tRNA bind to the large subunit at the bottom of 
a large and deep cleft at the entrance to the 100 A long polypeptide exit tunnel that runs through to the back of the 
subunit (see Figure 12). This site is surrounded by nucleotides belonging to the central loop of 23S RNA domain V, 
the "peptidyl transferase loop " Nucleotides from the single stranded portions of this loop make the closest approach 

4 ° to the phosphate that mimics the tetrahedral carbon intermediate. In general, the helices that extend from the peptidyl 
transferase loop in 2 secondary structure diagrams of 23S rRNA also extend away from the active site in the tertiary 
structure (see Figure 13). Although there are 13 proteins that interact with domain V (see Figure 14(A)). there are no 
globular proteins in the vicinity of the inhibitor. The closest polypeptides are the non-globular extensions of several 
proteins (L2. L3. L4. L10e) that penetrate deeply into domain V and approach the active site (see Figure 14(B)). These 

45 extensions fill many of the voids between the RNA helices of domain V. neutralize phosphate backbone charge, and 
presumably stabilize the structure of the domain and its association with other RNA regions. However, none of their 
side chain atoms is closer than about 18 A to the phosphorus of the inhibitor's phosphate group, which marks the site 
where peptide bonds form. Furthermore, the substrate analogue is completely enclosed in an rRNA cavity that is so 
tightly packed that there is no possibility that an unidentified peptide could be lurking nearby (see Figure 15). Thus, 

50 the catalytic entity in the ribosome must be RNA. 

[0196] Two of the proteins with long termini or loops penetrating the rRNA scaffold of domain V are proteins that 
could not previously be excluded from involvement in the peptidyl transferase reaction L2 and L3 (Noller (1991) Ann. 
Rev. Biochem. 60: 191-227). Noller and colleagues (Noller et al. (1992) supra) found that under conditions which 
prevent RNA denaturation. extensive digestion of Thcrmus thcrmophitus 50S subunits with proteases followed by 

55 extraction with phenol and other agents that disrupt protcin-RNA interactions did not remove several peptides from 
the subunit that were less than 10,000 in molecular weight. The structure makes it clear why these protein fragments 
were particularly resistant to protease treatments. While protease treatment could digest the globular protein domains 
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on the surface of the large subunit. it could not remove the long termini or loops that penetrate deeply in the 23S rRNA 
because they are sequestered within the rRNA and thus protected from cleavage, independently of the globular do- 
mains. 

5 3. Peptidyl Transferase Active Site. 

[0197] The RNA that surrounds the substrate analogues is closely packed, much like the active site region of a 
protein enzyme and the nucleotides in contact with the inhibitor are greater than 95% conserved in all three kingdoms 
of life (see Figure 15). Thus, it is clear that the ribosome is a ribozyme, but what gives the RNA its catalytic power? 

w [0198] Without wishing to be bound by theory, the residue most likely to be involved in catalysis, presumably as a 
general base, is A2486, whose N3 is about 3 A from the phosphoramide oxygen of the Yarus inhibitor that is the 
analogue of the carbonyl oxygen of a nascent peptide bond and about 4 A from the amide that corresponds to the 
amide nitrogen of the peptide bond being synthesized. Ordinarily, the pKa of the N1 of adenosine monophosphate is 
about 3.5 and that of its N3 is perhaps 2 pH units lower (Saenger (1984) Principles of Nucleic Acid Structure, (C.R. 

'5 Cantor, eds.). Springer Advanced Texts in Chemistry. Springer-Verlage. New York, NY), and in order for A2486 to 
function as a general base, its pKa would have to be raised to 7 or higher. The crystal structure itself suggests that its 
pKa is. in fact, quite unusual. The N3 of A2486 can only hydrogen bond to the phosphate oxygen, as observed, if it 
(or the phosphate oxygen) is protonated. The distance between these two atoms is about 3 A indicating that a hydrogen 
bond does, indeed, exist between Lhem. Since the crystal is at pH 5.8. this implies that the pKa of the N3 is greater 

20 than 6. Muth and Strobel have measured the pKa of the corresponding A in E. coli23S RNA by examining its dimethyl 
sulfate reactivity as a function of pH and have concluded that it is 7.6 ; though they cannot be sure from their experiments 
whether it is the N3 or N1 whose pKa they have measured (Muth et ai (2000) Science 289: 947-950). Because there 
is no other available, titratable RNA functional group closer than about 7 A to the nascent peptide bond, there is not 
other group available to function as a general base. 

25 [0199] There are several features of environment of A2486 (2451) that might affect its pKa. The pKa of the N3 of 
A2486 (2451) may be increased significantly in part by a charge relay mechanism, analogous to that which occurs in 
the active site of the serine proteases, with the buried phosphate of A2485 (2450) performing a similar function as the 
buried carboxylale of Asp 102 of chymotrypsin. The experimental 2 4 A electron density map unambiguously estab- 
lishes the hydrogen bonding interactions in this most critical region of the active site (see Figure 16(A)). The N6 of 

30 A2486 interacts with the 06 atoms of G2482 (2447) and G2102 (2061) (see Figure 16(B)). The N2 of G2482 is also 
interacting with a non-bridging oxygen of the phosphate group of A2485 (2450) that is among the total of 3 most solvent 
inaccessible phosphate groups (826. 1497 and 2485) in the large nbosomal subunit for which we do not see any 
neutralizing counterion in the 2.4 A resolution map. Weak density that may correspond to a water molecule is hydrogen 
bonded to the other non-bridging oxygen. A neutralizing counterion is not apparent in this structure. The buried phos- 

35 phate of A2485 could abstract the proton from the exocyclic N2 of G2482 in order to neutralize its energetically unfa- 
vorable buried negative charge. This, in turn, would stabilize the otherwise rare imino tautomer of that base. The 
interaction of an imino of G2482 with A2486 likewise can stabilize the imino tautomer of A2486 that would result in a 
negative charge on its N3 were it unprotonated (see Figure 16(C)). In this way. some of the negative electrostatic 
charge originating on the buried phosphate of A2485 could be relayed to the N3 of A2486. thereby increasing its pKa. 

40 [0200] A second feature of the environment of the catalytic site that may affect its stability, tautomeric state and 
electrostatic charge distribution is a bound monovalent cation. A potassium or a sodium ion interacts with the 06 of 
G2482 and G2102 as well as with three other bases. Its identity as a potassium ion is established by its observed 
continuation and by an independent experiment showing that a rubidium ion can bind to this site. The monovalent ion 
might also stabilize non-standard tautomers but its expected influence on the pKa of A2486 is less obvious. Early 

-ts biochemical experiments have shown the importance of potassium for peptidyl transferase activity (Monro (1 967) supra; 
Maden et ai. (1 968) supra) and this binding site could be responsible for that affect. 

[0201] II may also be the case that stabilization of an imino tautomer by a buried phosphate explains the expected 
higher pKa of a catalytic cytosine in the active site of the hepatitis delta virus ribozyme (Ferre-D'Amare ef a/. (1998) 
Nature 395 : 567-574; Naharo et at. (2000) Science 287: 1493-1497). in this case, a backbone phosphate, whose 
50 solvent accessibility is similar to that of A2485 in the ribosome. is observed to hydrogen bond to the N4 of C. and the 
protonated form of the imino tautomer of that C would neutralize the phosphate, promoting the function of its N3 as a 
general acid (Naharo ct ai (2000) supra). 

4. Catalytic Mechanism of Peptide Bond Formation. 

55 

[0202] The proximity of the N3 of A2486 (2451 ) to the peptide bond being synthesized and the nature of the reaction 
catalyzed suggest a chemical mechanism of peptide synthesis that is analogous to the reverse of the deacytation step 
seen in serine proteases during peptide hydrolysis (Blow et ai (1969) Nature 221: 337-340; Steitz et ai (1982) Ann. 
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Rev. Biophys. Bioeng. 11: 419-444). In that mechanism, the basic form of His57 abstracts a proton from the amino 
group of the peptide product as it attacks the acyl-Ser195. Formation of the tetrahedral carbonyl carbon intermediate 
is stabilized by interaction of the oxyanion formed with backbone amides (the "oxyanion hole"): His57 shuttles the 
proton acquired from the -NH 2 to Ser195 as the tetrahedral intermediate breaks down. 

5 [0203] The residue A2486 (2451 ) appears to be the analogous to His57 in chymotrypsin and that the peptidyl-tRNA 
is analogous to acyl-SeM 95. Thus, the N3 of A2486. with its greatly elevated pKa. abstracts a proton from the - amino 
group of the A-site bound aminoacyl-tRNA facilitating the nucleophilic attack of that amino group on the carbonyl carbon 
that acylates the 3' OH of the tRNA in the P-site (see Figure 17(A)). In contrast to the serine proteases, however, the 
oxyanion of the tetrahedral intermediate is near to the protonated N3 of A2486 (A2451 ) rather than being proximal to 

w a separate oxyanion binding site, Thus, it could be that the protonated N3 of A2486 stabilizes the formation of the 
oxyanion by hydrogen bonding to it. as we observe in the Yarus inhibitor complex (see Figure 1 7(B)). The N3 of A2486 
could then subsequently transfer its proton to the 3' hydroxyl of the P-site bound tRNA : which is liberated as the peptide 
shifts to the A-site bound tRNA (see Figure 17(C)). 

[0204] An additional question is how is the catalyzed hydrolysis of the peptidyl tRNA in the P-site prevented prior to 

is the delivery of the next appropriate aa-tRNA to the A-site? It appears from this complex that water would not be excluded 
from access to the peptidyl link to the P-site tRNA if the A-site were vacant. An analogous problem was discussed by 
Koshland in the 1960s (Koshland. Jr. (1963) Cold Spring Harbor Symp. Quant. Biol 28: 473-489). who theorized why 
hexokinase does not hydrolyze ATP in the absence of glucose, since water should bind perfectly well to the binding 
site used by the 6-hydroxyl of glucose. The answer proposed was induced fit. i.e., hexokinase is not catalytically com- 

20 petent until the glucose binds and produces a conformational change that orients substrates and catalytic groups 
optimally. This indeed appears to be the case (Bennett. Jr. et ai (1978) Proc. Natl. Acad. Set. USA 75: 4848-4852). 
Similarly, it could be that the catalytic A2486 and/or the peptidyl substrate are not properly oriented or that the binding 
site for the -NH 2 group is blocked by a reoriented ribosome base in the absence of aa-tRNA in the A-site. We do 
observe that the base of U2620 is close to A2486 in the ligand free structure, and it may serve as the appropriate plug 

25 that prevents spontaneous hydrolysis of peptidyl-tRNA. 

[0205] Thus, it appears that this RNA enzyme uses the same principles of catalysis as a protein enzyme. First, a 
large catalytic enhancement is achieved by precisely orienting the two reactants, the NH 2 from the A-site aminoacyl- 
tRNA and the carbonyl carbon from the P-site peptidyl-tRNA. This may be accomplished, in part by the interactions 
of the CCA ends of the A-site and P-site tRNAs with the A-loop and P-loop. respectively. Secondly acid-base catalysis 

30 and transition state stabilization are achieved by an enzyme functional group (A2486 (2451 ) in this case) whose chem- 
ical properties are altered appropriately by the active site environment. Third, similar chemical principles may be used 
by RNA and protein enzymes to alter the pKa's of functional groups. A buried carboxylate of Asp102 acting through 
His57 alters the nucieophiiicity of Ser195 in chymotrypsin (Blow et ai. (1969) supra). In the ribosome a solvent inac- 
cessible phosphate may act through G2482 (2447) alters the nucieophiiicity of the N3 of A2486 (2451 ) . It could be that 

35 RNA molecules "learned" how to use the chemical principles of catalysis significantly before protein molecules did. 

5. tRNA Binding. 

[0206] While it is not possible to bind tRNA molecules to either the A-or P-sites in these crystals for stenc reasons. 

^o it is possible to place the A-. P- and E-site tRNA molecules on the large ribosomal subunit in the same relative orientation 
that Cate et ai. observed in their crystallographic study of the Thermus aquaticus 70S ribosome. The co-ordinates of 
the three tRNA molecules in the relative positions seen in the 70S ribosome can be docked on the Haloarcula maris- 
mortui large ribosomal subunit in a way that avoids steric clashes and places the acceptor stems of the A-site and P- 
site tRNAs near to the positions of the CCAs we have observed bound to the A-loop and P-loop (see Figure 18). 

^5 Although nucleotides C74 and C75 were modeled in a different conformation in the 7.8 A ribosome map. the C74 
residues from the CCAs in both the A- and P-sites can be connected to residue 72 of the docked A-site and P-site 
tRNAs through a modeled residue 73. and it appears thai the IRNA molecules fit well onto the surface of the subunit. 
Unexpectedly, this modeling places the E-site. P-site and A-sile bound tRNA molecules in close proximity to three 
ribosomal proteins. Proteins L5 and L10e are near tRNAs in the P-site and A-site. Since both of these proteins also 

so interact with 5S rRNA. this observation raises the possibility that 5S rRNA and some of its associated proteins might 
help stabilize the positioning of ribosome bound tRNAs and is consistent with the fact that 5S rRNA enhances ribosomal 
activity, but is not absolutely essential for it (Moore. Ribosomal RNA & Structure. Evolution. Processing and Function 
in Protein Biosynthesis (1996). supra, pp. 199-236). Protein L44e appears to interact with the E-site tRNA and may 
contribute to E-site activity. According to this docking experiment the A-site tRNA interacts with the highly conserved 

55 stem-loop 2502-251 8 (2467-2483) which together with L10e forms a large concave surface that contacts the tRNA on 
the T-stem. utilizing the exact same binding site exploited by EF-Tu (Gutetl et ai (2000) supra) 
[0207] Examination of the relationships between the CCAs bound in the A- and P-sites and the tRNAs to which they 
are connected as well as their interactions with the ribosome also leads to some insights into translocation. Immediately 
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after formation of the new peptide bond and deacytation of the P-site tRNA, the acceptor end of the P-site tRNA is 
known to move to the E-side and that of the A-site tRNA moves to the P-site (Blobel ef at (1970) J. Cell. Biol. 45: 
130-145). The approximate modeling of the 3 tRNA molecules on the large subunit suggests some possible contribu- 
tions to this process. First, there are two base-pairs between the P-site tRNA and the P-loop and only one between 
5 the A-site and the A-loop. Moving from the A- to the P-site increases base-pairing, though there must be a concomitant 
attraction of the deacylated P-site tRNA to an E-site. Further, the CCAs bound to the A and P loops are related by 1 80° 
rotation, whereas the tRNAs to which they are attached are not. Thus, the relationships of the CCAs to the acceptor 
stems cannot be the same in both sites and may not be equally stable. If the conformation of the A-site tRNA is less 
stable, then moving a tRNA from the A- to the P-site might be energetically favored. 

10 

6. Polypeptide Exit Tunnel. 

[0208] It appears very likely from the structure that all nascent polypeptides pass through the polypeptide exit tunnel 
before emerging from the ribosome. because there appears to be no other way out. We are now able to address two 
15 important questions about the functioning of the polypeptide exit tunnel: (1) Why do nascent proteins not stick to its 
walls? Teflon has the marvelous property of not sticking to denatured egg proteins, so how has the ribosome achieved 
a similar non-stick surface for the denatured proteins that must pass through the tunnel? (2) Do proteins fold to any 
degree in the tunnel giving the ribosome a chaperon-like function? 

[0209] The length of the tunnel from the sile of peptide synthesis to its exit is about 100 A, broadly consistent wilh 
20 the length of nascent polypeptide that is protected from proteolytic cleavage by the ribosome (Moazed et at (1989) 
Nature 342.1 42) and the minimum length required for antibody recognition at the exit (Picking etat (1992) Biochemistry 
31 : 2368-2375). The tunnel is largely straight, except for a bend 20 to 35 A from the peptidyl transferase center (see 
Figure 1 9). Its diameter varies from about 20 A at its widest to a narrow point of about 1 0 A at the very beginning and 
at a position 28 A from the tunnel exit with an average diameter of about 15 A. Since the smallest orifice through which 
25 the polypeptide product must pass only barely accommodates the diameter of an -hel i x diameter, it seems unlikely 
that significant protein folding beyond the formation of -helix could occur within the ribosome. 

[0210] The majority of the tunnel surface is formed by domains I - V of 23S rRNA, but significant contributions are 
also made by the non-globular regions of proteins L22, L4 and L39 which not only fill some of the voids in the RNA 
scaffold, but also form significant portions of the tunnel wall (see Figure 19). The largest protein contributor to the 

30 surface of the tunnel is L22 whose long - hairpin loop lies between RNA segments of domains I through IV and is 
approximately parallel with the axis of the tunnel. Unlike the other tunnel proteins, protein L39 does not have a globular 
domain at the surface of the particle and is almost entirely buried in domains I and III underneath protein L23. Inter- 
estingly, the nucleotides of 23S rRNA that form the tunnel wall are predominantly from loops in the 23S rRNA secondary 
structure (see Figure 1 9). As it progresses through the tunnel from the active site, a nascent polypeptide first encounters 

35 domain V followed 20 A further along by domains II and IV and proteins L4 and L22. The last half of the tunnel is formed 
by domains I and III and the protein L39e. 

[0211] The narrowest part of the tunnel is formed by proteins L22 and L4 which approach the tunnel from opposite 
sides forming what appears to be a gated opening (see Figure 19C). The function of this constriction, if any. is not 
obvious. It might be the place where the nature of the nascent chain is sensed and the information transmitted to the 
JO surface of the particle, perhaps through L22 or L4. The -ha i rpin of L22 at the site of this orifice and the 23S rRNA 
interacting with it are highly conserved; its globular portion is located adjacent to the tunnel exit on the surface that 
must face the translocon during protein secretion (see Figure 19). 

[0212] The "non-stick" character of the tunnel wall must reflect a lack of structural and polarity complementarity to 
any protein sequence or conformation that it encounters. The tunnel surface is largely hydrophilic and includes exposed 
hydrogen bonding groups from bases, backbone phosphates and polar protein side-chains (see Figure 19). While 
there are many hydrophobic groups (sugars, bases, protein side-chains) facing the tunnel as well, there are no patches 
of hydrophobic surface large enough to form a significant binding site for hydrophobic sequences in the nascent 
polypeptide. As the tunnel is some 20 A in diameter and filled with water and the newly synthesized polypeptide is 
presumably freely mobile, the binding of a peptide to the tunnel wall would result in a large loss of entropy that would 

50 have to be compensated for by a large complementary interaction surface that is larger than 700 A (Chothia ef at 
(1975) Nature25S. 705-708). Similarly, while Arg and Lys side-chains from a nascent peptide may indeed interact with 
the phosphates exposed in the tunnel, the degree of structural complementarity and the net binding energy obtained 
after displacing bound counterions must be too small to overcome the large unfavorable entropy of immobilization that 
would result from peptide binding. Thus, although the ribosome tunnel is made primarily of RNA. the nature of its 

55 surface is reminiscent of the interior surface of the chaperonin, GroEL (Xu ef at (1998) J. Struct Biol. 124:129-141) 
in its non-binding conformation. Only in the conformation that exposes a large hydrophobic surface does GroEL bind 
a denatured protein. 

[0213] There are six proteins (L19, L22, L23. L24. L29 and L31e) located at the exit from the tunnel, facing the 
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translocon onto which the ribosome docks during protein secretion. There is evidence that the ribosome binds the 
translocon even after extensive digestion of its protein by protease implying that interaction between the translocon 
and the ribosome is mediated by RNA. The proximity of these proteins to the translocon. however, leads us to wonder 
what role, if any. they might play in the protein secretion process. Recent data from the Dobberstein laboratory shows 
5 that the N-terminal domain of SRP54. the G-protein from the signal recognition particle involved in signal peptide 
binding, can be crosslinked to ribosomal proteins L23 and L29. These two proteins are adjacent to each other and at 
the tunnel exit (see Figure 19). 

7. Evolution. 

10 

[0214] In vitro evolution of RNA oligonucleotides has produced small RNA molecules that can bind molecules like 
the Yarus inhibitor effectively or catalyze the peptidyl transfer reaction (Zhaug et ai (1998) Chem. Biol. 5: 539-553; 
Welch et al. (1997) supra). The sequence and secondary structure of one of these selected RNAs is reminiscent of 
the peptidyl transferase loop in domain V of 23S rRNA (Zhaug et ai. (1998) supra). The most striking similarity is a five 

is nucleotide sequence that is identical to a sequence in domain V that includes the catalytic A2486. G2482 and the 
buried phosphate of A2485. Remarkably all of the groups involved in the proposed charge relay system for activating 
A2486 in the ribosome, are present in the in vitro selected ribozyme. Thus, though the surrounding structural context 
is likely to be different, it seems plausible that this artificially evolved ribozyme uses the same mechanisms as the 
ribosome for shifting the pKa or an adenine and likewise uses it as a base for peptide synthesis. A second RNA (Welch 

20 et al. (1 997) supra) contains a 12 nucleotide loop that includes a 9-base sequence identical to that found in the same 
region of the peptidyl transferase loop. 

[0215] The striking similarities between the sequences containing the key catalytic elements found in the peptidyl- 
transferase active site of the ribosome and sequences of in vitro selected RNAs having related activities make it clear 
that the appearance of a small RNA domain capable of catalyzing peptidyl transferase was a plausible first step in the 

25 evolution of protein synthesis on the ribosome. The first peptides synthesized by this primordial peptide synthesizing 
enzyme might have been random polymers or copolymers, and it may have functioned with substrates as simple as 
an aminoacylated CCA. Basic peptides of the types observed to form the non-globular extensions that co-fold with the 
23S rRNA might have been among the first peptides synthesized that were functionally useful. Such peptides may 
have enhanced the stability of the protoribosome and other early ribozymes as the more sophisticated peptides of the 

30 present day ribosome appear to do. 

C. Atomic Structure of the Large Ribosomal Subunit at 2.4 A Resolution, Complete Refinement 

[0216] The three-dimensional structure of the large ribosomal subunit from Haloarcula mahsmortui has now been 

35 completely refined at 2.4A resolution. The model includes 2876 RNA nucleotides. 3701 amino acids from 28 ribosomal 
proteins. 117 magnesium ions. 88 monovalent cations, and 7898 water molecules. Many of its proteins consist of a 
surface-exposed globular domain and one or more basic, non-globular extensions that are buried in rRNA. Half of them 
include motifs common in non-ribosomal proteins including, for example: RRM domains. SH3-like barrels and zinc 
fingers. Proteins that have significant sequence and structural similarity, such as L15 and L18e : make essentially 

-to identical interactions with rRNA. 

[0217] More particularly, the H. mahsmortui 50S subunit has been completely rebuilt and refined by successive 
rounds of gradient energy minimization and B-factor refinement using CNS (Brunger etal. (1998) supra). Ribosomal 
proteins and rRNA were completely rebuilt using the software program "O" (Jones ; T.A. et al. (1991) Acta Crystallogr. 
A46: 110-119) with 2F G -Fc electron density maps prior to the modeling of solvent and metal ions. Modeling errors in 

45 the proteins were identified using PROCHECK (Laskowski et al. (1 993) J. Appl. Crysl 26: 283-291 ) and by inspection 
of F -Fc maps. Difference maps also aided in the identification of errors in the rRNA, most often associated with sugar 
puckers In the process, some adjustments were made in amino acid conformations, sequence register and in se- 
quences themselves. Sequence changes made were largely limited to L10e. L15e. and L37Ae, the only three proteins 
from the H. mahsmortui SOS that have not been sequenced directly. In addition, fifty-one amino acids were added to 

so the model described in section HA with forty-four of these coming from L10 at the base of the L7/L12 stalk and L39e 
which lines a portion of the wall of the polypeptide exit tunnel. Fewer adjustments were made to the rRNA structure. 
Forty-nine new nucleotides were modeled and refined, mainly in helices 43 and 44 in domain II of 23S rRNA. In addition, 
the sugar pucker or conformation about the glycoside bond was adjusted for some nucleotides. The refinement process 
was monitored by the quality of electron density maps calculated using phases derived from the model as well as R / 

55 R (ree values. The completely refined model now includes 2876 RNA nucleotides. 3701 amino acids ; 210 metal ions, 
and C 7898 water molecules The model refines to an R/R, ree of 18.9% / 22.3% and has excellent geometry (Table 4). 
[0218] Solvent modeling began with the generation of a list of possible magnesium ions obtained by an automatic 
peak selection using CNS. Peaks greater than 3.5 in F D -Fc maps positioned within the magnesium inner-sphere co- 
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ordination distance of 1 9 - 2 . 1 A from N or O atoms were selected. The resulting list was manually inspected and only 
peaks that displayed clear octahedral coordination geometry were selected as magnesium. 

[0219] Monovalent cations were identified on the basis of isomorphous differences between rubidium-soaked and 
native crystals of the H. marismortui SOS subunit. Since native crystals were stabilized in the presence of 1 .6M NaCI. 

5 these sites were initially modeled and refined as Na* 1 . Refinement of these sites as K +1 almost always resulted in 
unusually high temperature factors, with two exceptions where we have modeled K +1 . Most of the monovalent sites in 
the SOS subunit appear to occupied by Na 41 in our crystals, however, these sites are likely to be occupied by K +1 in vivo. 
[0220] Waters were selected as peaks greater than 3.5 in F Q -F C electron density maps and between 2.5 and 3.3 A 
of O or N atoms. Individual B-factor values were used to assess the assignment of water molecules. A number of 

10 waters refined to B-factors significantly lower than surrounding RNA and protein atoms. In many cases these peaks 
were found to be metal or chloride ions. A small number of low B-factor water molecules were retained in the final 
model because they could not be unambiguously assigned as other species. As a result of adding metal ions and water 
molecules the final model now contains 98. 547 non-hydrogen atoms. The refinement and model statistics for the large 
ribosomal subunit are summarized in Table 4. 

75 



Table 4 





Refinement and Model Statistics for the H. marismortui SOS Subunit 




Space Group 


C222 n 


20 


a = 21 1 .66A, b = 299.67A, c = 573.77A 






Total non-hydrogen atoms 


9B : 542 




RNA atoms 


61,617 




Protein atoms 


28 ; 800 




Water molecules 


7.893 


25 


Magnesium ions 


117 




Potassium ions 


2 




Sodium ions 


86 




Chloride ions 


22 


30 


Cadmium ions 


5 




Refinement Statistics: 




Resolution Range 


15.0-2.4A 




Number of reflections used in refinement 


623,525 


35 


Number of reflections for cross-validation 


6.187 




^ working 


18.9% 




R free 


22.3% 




o a coordinate error (cross-validated) 


0.35A (0.43A) 




luzzati coordinate error (cross-validated) 


0.29A (0.35A) 


40 


Deviations from ideality: 






r.m.s.d. bond lengths 


0.0052A 




r.m.s.d. bond angles 


1.13° 




r m.s.d. dihedrals 


15.7° 


45 


r.m.s.d. impropers 


2.12° 




Protein Statistics from Ramachandran Plot: 




Residues in most favored regions 


2704 (86.6%) 




Residues in additional allowed regions 


379 (12.1%) 


50 


Residues in generously allowed regions 


27 (0.9%) 




Residues in disallowed regions 


13 (0.4%) 




Average B-factor Statistics (A 2 ): 




All atoms (high / low) 


44.3(10.1/133.7) 


55 


rRNA 


41.2(11.78/125.0) 




proteins 


49.7(13.9/92.5) 




waters 


41.89(9.58/115.4) 
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[0221] Refinement has also permitted additional modeling of L10. L39e : and the L11 binding site in 23S rRNA. Fur- 
thermore, it has been discovered that certain motifs, for example. RRM topologies. SH3-like barrels and zinc fingers 
are common in the 50S proteins and each recognizes rRNA in many different ways. Proteins that have significant three- 
dimensional homology, however, such as L1 5 and L1 8e as well as L1 8 and S1 1 . make essentially identical interactions 

5 with rRNA. Additional structural homologies between SOS proteins and non-ribosoma! proteins also are apparent. The 
solvent exposed surfaces of these globular protein domains are rich in aspartate and glutamate residues, while irregular 
protein extensions penetrate the RNA core of the ribosome. These extensions are often highly conserved, and their 
abundance of arginine. lysine, and glycine residues is important for their function Collectively, the results show evo- 
lutionary connections between many ribosomal proteins and illustrate that protein-RNA interactions in the ribosome. 

10 although largely idiosyncratic, share some common principles. 

D. Antibiotic Binding Sites 

[0222] In addition to the foregoing structural studies, the structure of the large ribosomal subunit of H. marismortui 
is has been determined complexed with each of seven different antibiotics. More specifically, crystals of the H. marismortui 
large ribosomal subunit have been soaked with one of the following antibiotics: anisomycin. blasticidin. carbomycin. 
tylosin. sparsomycin. virginiamycin or spiramycin. The structure of the large ribosome subunit complexed with each 
antibiotic was then resolved based on X-ray diffraction data generated for each crystal. 

[0223] Briefly, a small amount of a concentraled antibiotic solulion was added to a large subunit crystal suspended 
20 in stabilization solution and incubated for several hours. Following freezing and the other procedures normally used 
to prepare such crystals for experimental use. X-ray diffraction data were collected from the antibiotic containing crys- 
tals. Because the crystals were isomorphous to those from which the structure described above was derived, the 
phases obtained for native crystal were combined with the diffraction intensities obtained from the antibiotic-soaked 
crystal to obtain a structure for the latter. The position of the antibiotic in the crystal to which was bound is revealed 
25 most clearly in difference electron density maps, which are electron density maps computed using the phases just 
referred to and amplitudes obtained by subtracting the amplitudes of crystals that contain no antibiotic from the (suitably 
scaled) amplitudes of those that contain antibiotic. By using the foregoing methods, it was possible to determine the 
atomic co-ordinates that show the spatial relationship between particular antibiotics and their binding sites within the 
large ribosomal subunit It is contemplated that similar methods can be used to resolve the structure of other antibiotics 
30 complexed to the large ribosomal subunit. 

[0224] The atomic co-ordinates of the large ribosomal subunit complexed with antsomycin are listed in a table on 
compact disk Disk No. 3 of 3 under the file name anisomycin. pdb. In addition. Figure 20 shows the spatial relationship 
between the antibiotic anisomycin and the large ribosomal subunit. 

[0225] The atomic co-ordinates of the large ribosomal subunit complexed with blasticidin are listed in a table on 
35 compact Disk No. 3 of 3 under the file name blasticidin. pdb. Figure 21 shows the spatial relationship between the 
antibiotic blasticidin and the large ribosomal subunit. For orientation. Figure 21 also includes a substrate for the P-site. 
[0226] The atomic co-ordinates of the large ribosomal subunit complexed with carbomycin are listed in a table on 
compact disk Disk No. 3 of 3 under the file name carbomycin. pdb Figure 22 shows the spatial relationship between 
the antibiotic carbomycin and the large ribosomal subunit. Figure 22 also shows a portion of the polypeptide exit tunnel. 
40 [0227] The atomic co-ordinates of the large ribosomal subunit complexed with tylosin are listed in a table on compact 
disk Disk No. 3 of 3 under the file name tylosin. pdb. Figure 22 shows the spatial relationship between the antibiotic 
tylosin and the large ribosomal subunit. Figure 22 also shows a portion of the polypeptide exit tunnel. 
[0228] The atomic co-ordinates of the large ribosomal subunit complexed with sparsomycin are listed in a table on 
compact disk Disk No. 3 of 3 under the file name sparsomycin. pdb. Figure 23 shows the spatial relationship between 
•is the antibiotic sparsomycin and the large ribosomal subunit. For orientation. Figure 23 also shows a substrate for the 
P-site. 

[0229] The atomic co-ordinates of the large ribosomal subunil complexed with virginiamycin are listed in a table on 
compact disk Disk No. 3 of 3 under the file name virginiamycin. pdb. Figure 24 shows the spatial relationship between 
the antibiotics virginiamycin as well carbomycin, and the large ribosomal subunit. 
so [0230] The atomic co-ordinates of the large ribosomal subunit complexed with spiramycin are listed in a table on 
compact disk Disk No. 3 of 3 under the file name spiramycin. pdb. 

[0231] Figure 25 shows the spatial orientations of several antibiotics, namely, blasticidin, anisomycin. virginiamycin 
and carbomycin. as they bind to their respective antibiotic binding sites within the large ribosomal subunit. For purposes 
of orienting the reader, the positions of the P-site. A-site and the polypeptide exit tunnel are shown in Figure 25. As is 
55 apparent, these antibiotics bind to or contact specific locations within the large ribosomal subunit to disrupt protein 
biosynthesis. For example, it appears that blasticidin binds the large ribosomal subunit in the vicinity of the P-site; 
anisomycin and virginiamycin bind the large ribosomal subunit in the vicinity of the A-site: and carbomycin (a macrolide) 
binds the large ribosomal subunit in the vicinity of the polypeptide exit tunnel adjacent the peptidyl transferase stte. 
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[0232] From Figure 25 . it is apparent that the skilled artisan may identify certain portions of each antibiotic that contact 
regions in the large ribosomal subunit. By knowing their spatial relationship with respect one another, the skilled artisan 
may generate a hybrid antibiotic molecule comprising a portion of a first template antibiotic and a portion of a second, 
different template antibiotic. The two portions may be linked by a chemical linker so as to maintain the spatial orientation 
5 of one portion with respect to the other portion. As a result, the hybrid antibiotic may simultaneously bind each of the 
regions of the ribosomal subunit typically bound by each template antibiotic. The design and testing of such molecules 
is discussed in more detail below. 

E. Experimental Techniques Which Exploit X-Ray Diffraction Data 

10 

[0233] Based on the X-ray diffraction pattern obtained from the assemblage of the molecules or atoms in a crystalline 
solid, the electron density of that solid may be reconstructed using tools well known to those skilled in the art of crys- 
tallography and X-ray diffraction techniques. Additional phase information extracted either from the diffraction data and 
available in the published literature and/or from supplementing experiments may then used to complete the recon- 
15 struction. 

[0234] For basic concepts and procedures of collecting, analyzing., and utilizing X-ray diffraction data for the con- 
struction of electron densities see. for example. Campbell et at. (1984) Biological Spectroscopy , The Benjamin/Cum- 
mings Publishing Co.. Inc.. (Menlo Park, CA); Cantor et at. (1980) Biophysical Chemistry. Part II: Techniques for the 
study of biological structure and function , W.H. Freeman and Co.. San Francisco, CA: A.T Brunger (1993) X-PLOR 

20 Version 3.1 : A system for X-ray crystallography and NMR. Yale Univ. Pr. ; (New Haven, CT): M.M. Woolfson (1 997) An 
Introduction to X-ray Crystallography , Cambridge Univ. Pr., (Cambridge. UK); J. Drenth (1999) Principles of Protein X- 
ray Crystallography (Springer Advanced Texts in Chemistry), Springer Verlag; Berlin; Tsirelson et ai (1996) Electron 
Density and Bonding in Crystals: Principles Theory and X-ray Diffraction Experiments in Solid State Physics and 
Chemistry . Inst, of Physics Pub,; U.S. Patent No. 5.942.428; U.S. Patent No. 6.037.117; U.S. Patent No. 5,200,910 

25 and U.S. Patent No. 5.365.456 ("Method for Modeling the Electron Density of a Crystal"). 

[0235] A molecular model may then be progressively built using the experimental electron density information and 
further refined against the X-ray diffraction data resulting in an accurate molecular structure of the solid. 

F. Structural Determination Of Other Large Ribosomal Subunits 

30 

[0236] It is understood that the skilled artisan, when provided with the atomic co-ordinates of a first macromolecule 
may use this information to quickly and easily determine the three-dimensional structure of a different but structurally 
related macromolecule. For example, the atomic co-ordinates defining the large ribosomal subunit from H. marismortui 
can be used to determine the structure of the large ribosomal subunit from other species either as an isolated subunit. 

35 in complex with the small subunit. or either of these complexed with functionally important ligands, for example: ami- 
noacyl tRNA; various protein synthesis factors, such as elongation factor G, elongation factor Tu, termination factor or 
recycling factor, in both their GTP and GDP conformational states; and protein synthesis inhibitors, for example, anti- 
biotics. In addition, the H. marismortui subunit co-ordinates can also be used to solve the structures of ribosomal 
complexes with components of the protein secretion machinery, for example, the signal recognition particle ; and the 

-to translocon. 

[0237] If the crystal being examined contains a macromolecule of unknown structure and no additional information 
is available, additional experiments sometimes may be required to determine the relevant phases of the macromolecule. 
These studies can often be time consuming and uncertain of success (Blundell et ai. (1976) supra). However, when 
additional information, for example, structural and/or crystal log rap hie information, is available for molecules related in 
^5 some way to the macromolecule of interest then the process of resolving the structure of the molecule of interest is a 
much less challenging and time-consuming task. 

[0238] Accordingly, the skilled artisan may use information gleaned from the prior resolved slruclure to develop a 
three-dimensional model of a new molecule of interest. Furthermore, the skilled artisan may use a variety of approaches 
to elucidate the three-dimensional structure of the new molecule. The approaches may depend on whether crystals of 
so the molecule of interest are available and/or whether the molecule of interest has a homologue whose structure has 
already been determined. 

[0239] In one approach, if the molecule of interest forms crystals that arc isomorphous, i.e., that have the same unit 
cell dimensions and space group as a related molecule whose structure has been determined, then the phases and/ 
or co-ordinates for the related molecule can be combined directly with newly observed amplitudes to obtain electron 
55 density maps and. consequently, atomic co-ordinates of the molecule of interest. The resulting maps and/or atomic 
co-ordinates may then be refined using standard refinement techniques known in the art. In another approach, if the 
molecule of interest is related to another molecule of known three-dimensional structure, but crystalli7es in a different 
unit cell with different symmetry, the skilled artisan may use a technique known as molecular replacement to obtain 
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useful phases from the co-ordinates of the molecule whose structure is known (Blundell et at. (1976) supra). Tnese 
phases can then be used to generate an electron density map and/or atomic co-ordinates for the molecule of interest. 
In another approach, if no crystals are available for the molecule of interest but it is homologous to another molecule 
whose three-dimensional structure is known, the skilled artisan may use a process known as homology modeling to 

5 produce a three-dimensional model of the molecule of interest. It is contemplated that other approaches may be useful 
in deriving a three-dimensional mode! of a molecule of interest. Accordingly, information concerning the crystals and/ 
or atomic co-ordinates of one molecule can greatly facilitate the determination of the structures of related molecules. 
[0240] The method of molecular replacement, developed initially by Rossmann and Blow in the 1960s, is now used 
routinely to establish the crystal structures of macromolecules of unknown structures using the structure of a homol- 

10 ogous molecule, or one in a different state of ligation (M.G. Rossmann. ed. "The Molecular Replacement Methods," 
Int. Sci. Rev. J. No. 13. Gordon & Breach. New York. NY (1972): Eaton Lattman. "Use of Rotation and Translation 
Functions/' H.W. Wyckoff, C.H. W. Hist. (S.N. Timasheff. ed.) Methods in Enzymology , 115: 55-77 (1985)). For an 
example of the application of molecular replacement, see, for example, Rice. P.A. & Steitz ; T.A.(1994) EMBO J. 13: 
1514-24. 

15 [0241] In molecular replacement, the three-dimensional structure of the known molecule is positioned within the unit 
cell of the new crystal by finding the orientation and position that provides the best agreement between observed 
diffraction amplitudes and those calculated from the co-ordinates of the positioned subunit. From this modeling, ap- 
proximate phases for the unknown crystal can be derived. In order to position a known structure in the unit cells of an 
unknown, but related structure, three rotation angles and three translations relative to the unit cell origin have to be 

20 determined. The rotation search is carried out by looking for agreement between the Patterson function of the search 
and target structures as a function of their relative orientation (the rotation function). X-PLOR (Brunger et a/. (1987) 
Science 235:458-460: CNS (Crystallography & NMR System. Brunger era/.. (1998) Acta Cryst. Sect 054: 905-921), 
and AMORE: an Automatic Package for Molecular Replacement (Navaza : J. (1994) Acta Cryst. Sect. A. 50; 1 57-1 63) 
arc computer programs that can execute rotation and translation function searches. Once the orientation of a test 

25 molecule is known, the position of the molecule must be found using a translational search. Once the known structure 
has been positioned in the unit cell of the unknown molecules, phases for the observed diffraction data can be calculated 
from the atomic co-ordinates of the structurally related atoms of the known molecules. By using the calculated phases 
and X-ray diffraction data for the unknown molecule, the skilled artisan can generate an electron density map and/or 
atomic co-ordinates of the molecule of interest. 

30 [0242] By way of example, it is contemplated that a three-dimensional model of a ribosomal subunit other than that 
derived from H. marismortui can be generated via molecular replacement. In this method, the H. mahsmortui subunit 
structures are positioned within the unii cell of the new crystal by finding the orientation and position that provides the 
best agreement between observed diffraction amplitudes and those calculated from the co-ordinates of the positioned 
subunit. A starting electron density map calculated using 2F hk( (observed) - F hk) (calculated), where F(observed) are the 

35 diffraction amplitudes that have been measured from crystals of the unknown structure, and F(calculated) are the 
diffraction amplitudes calculated from the positioned H. marismortui subunit structure. Refinement of the initial model 
can be done as is standard in the field of macromolecular crystallography. 

[0243] The H. marismortui SOS structure can also be used to establish the structure of a 70S ribosome or 50S 
ribosome for which an election density map has been calculated, at a resolution that would otherwise be too low to be 

•*o interpreted, while a 5 resolution map could not be interpreted in atomic terms de novo, a plausible atomic model can 
be constructed by refitting the H. marismortui SOS structure to a lower resolution map (e.g.. 4.5 to 8 ). This refitting can 
be combined with homology modeling to obtain a three-dimensional model of a ribosome or ribosomal subunit from a 
different species. It is contemplated that similar procedures may be used to determine the structure of the eukaryotic 
60S subunit and/or a eukaryotic ribosome. 

^5 [0244] In general, the success of molecular replacement for solving structures depends on the fraction of the struc- 
tures that are related and their degree of identity. For example, if about 50% or more of the structure shows an r.m.s. 
difference between corresponding atoms in the range of about 2 or less, the known structure can be successfully used 
to solve the unknown structure. 

[0245] Homology modeling, also known as comparative modeling or knowledge-based modeling, can be used to 
so generate a three-dimensional model for a molecule based on the known structure of homologues. In general, the 
procedure may comprise one or more of the following steps: aligning the amino acid or nucleic acid sequence of an 
unknown molecule against the amino acid or nucleic acid sequence of a molecule whose structure has previously been 
determined: identifying structurally conserved and structurally variable regions: generating atomic co-ordinates for core 
(structurally conserved) residues of the unknown structure from those of the known structure(s); generating conforma- 
55 tions for the other (structurally variable) residues in the unknown structure: building side chain conformations: and 
refining and/or evaluating the unknown structure. 

[0246] By way of example, since the nucleotide sequences of all known SOS subunit rRNAs can be aligned relative 
to each other and to H. marismortui 23S and 5S rRNAs, it is possible to construct models of the structures of other 
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SOS ribosomal rRNAs. particularly in the regions of the tunnel and active sites, using the H. marismortui structure 
Likewise, homologous proteins can also be modeled using similar methodologies. Methods useful for comparative 
RNA sequence analysis are known in the art and include visual methods and number pattern methods, as well as 
methods employing chi-square statistics, phylogenetic algorithms, or empirical algorithms. Descriptions of some of the 

5 foregoing methods are available, for example, at http://www.rna.icmb.utexas.edu/ ; Gutell (1996). "Comparative Se- 
quence Analysis and the Structure of 16S and 23S rRNA." Ribosomal RNA, Structure. Evolution. Processing, and 
Function in Protein Biosynthesis , (Dahlberg A. and Zimmerman B., eds.) CRC Press. Boca Raton, pp. 1 11 -128; Guttell 
etai. (1993) Nucl. Acid Res. 21 : 3055 - 3074; Schnare etal. (1996) J. Mol. Biol. 256: 701-719. Particularly useful visual 
inspection methods include comparison of a particular position in a H. marismortui secondary structure diagram with 

10 the residues located at the analogous position on an E. coli secondary structure diagram. A software program that is 
particularly useful in homology modeling includes XALIGN (Wishart, D. era/.. (1994) Cabios 10: 687-88). See also, U. 
S. Patent No. 5 ; 8B4,230. 

[0247] To model the rRNA of a new species, bases of the H marismortui rRNA can be replaced, using a computer 
graphics program such as "O" (Jones et a/., (1991) Acta Cryst. Sect. A. 47: 110-119), by those of the homologous 

15 rRNA, where they differ. In many if not most cases the same orientation of the base will be appropriate. Insertions and 
deletions may be more difficult and speculative, but the rRNA forming the peptidyl transferase site and the portion of 
the tunnel closest to it are very highly conserved with essentially no insertions and deletions. Automated web-based 
homology modeling can be performed using, for example, the computer programs SWISS-MODEL available through 
Glaxo Wellcome Experimental Research in Geneva. Switzerland, and WHATIF available on EMBL servers. 

20 [0248] For other descriptions of homology modeling, see. for example, Gutell R.R. (1996). supra; Gutell R.R.. etai. 
(1993) Nucleic Acids Res. 21: 3055-3074; Schnare etai (1996) J. Mol. Biol., 256: 701-719; Blundell era/. (1987) Nature 
326: 347-352; Fetrow and Bryant (1993) Bio/Technology 11:479-484; Greer (1991) Methods in Enzymology 202: 
239-252: and Johnson etal. (1994) Crit. Rev. Biochem. Mol. Biol. 29:1 -68. An example of homology modeling can be 
found, for example, in Szklarz G.D (1997) Life Sci. 61 : 2507-2520. 

25 [0249] As discussed earlier, the large ribosomal subunit from prokaryotes and eukaryotes are structurally conserved 
The amino acid sequences of the large ribosomal subunit from prokaryotes and eukaryotes can be aligned due to the 
evolutionary conservation of the identity of amino acid residues that are important for three-dimensional structure, the 
nature and shape of the binding sites for substrates and the catalytic site. This similarity in amino acid sequence of 
the homologous large ribosomal subunit allows the construction of models, via homology modeling, for the molecules 

30 whose crystal structures have not been solved. 

[0250] The new ribosome or large ribosomal subunit structures determined using the H. marismortui crystals and/ 
or atomic co-ordinates can then be used for structure-based drug design using one or more of the approaches described 
hereinbelow. This information can then be used to design molecules that selectively bind and disrupt protein synthesis 
in the ribosomes of the pathogens while leaving the ribosomes of a host relatively unaffected. 

35 

G. Rational Drug Design 
f. Introduction 

•to [0251 ] It is contemplated that the atomic co-ordinates defining a large ribosomal subunit of interest whether derived 
from one or more of X-ray crystallography, molecular modeling, homology modeling or molecular replacement, may 
be used in rational drug design (RDD) to design a novel molecule of interest, for example, novel modulators (for ex- 
ample, inducers, mimetics or inhibitors) of ribosome function. Furthermore, it is contemplated that, by using the prin- 
ciples disclosed herein, the skilled artisan can design, make. test, refine and use hovel protein synthesis inhibitors 

J 5 specifically engineered to reduce disrupt, orotherwise or inhibit ribosomal function in an organism or species of interest. 
For example, by using the principles discussed herein, the skilled artisan can engineer new molecules that specifically 
target and inhibit ribosomal function in a pathogen for example, a particular prokaryolic. organism, while preserving 
ribosomal function in a host, for example, a eukaryotic organism . specifically a mammal, and more specifically, a human. 
As a result, the atomic co-ordinates provided and discussed herein permit the skilled artisan to design new antibiotics 

so that can kill certain pathogenic organisms while having little or no toxicity in the intended recipient, for example, a human. 
[0252] It is contemplated that RDD using atomic co-ordinates of the large ribosomal subunit can be facilitated most 
readily via computer-assisted drug design (CADD) using conventional computer hardware and software known and 
used in the art. The candidate molecules may be designed de novo or may be designed as a modified version of an 
already existing molecule, for example, a pre-existing antibiotic, using conventional methodologies. Once designed, 

55 candidate molecules can be synthesized using standard methodologies known and used in the art. Following synthesis, 
the candidate molecules can be screened for bioactivity. for example, by their ability to reduce or inhibit ribosome 
function their ability to interact with or bind a ribosome or a ribosomal subunit Based in part upon these results, the 
candidate molecules may be refined iteratively using one or more of the foregoing steps to produce a more desirable 
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molecule with a desired biological activity. The resulting molecules can be useful in treating, inhibiting or preventing 
the biological activities of target organisms, thereby killing the organism or impeding its growth. Alternatively, the re- 
sulting molecules can be useful for treating, inhibiting or preventing microbial infections in any organism, particularly 
animals, more particularly humans. 

5 [0253] In summary, the tools and methodologies provided by the present invention may be used to identify and/or 
design molecules of interest which bind and/or interact in desirable ways with ribosomes and ribosomal subunits. 
Basically, the procedures utilize an iterative process whereby the molecules are synthesized, tested and characterized. 
New molecules can be designed based on the information gained in the testing and characterization of the initial 
molecules and then such newly identified molecules can themselves be tested and characterized. This series of proc- 

10 esses may be repeated as many times as necessary to obtain molecules with desirable binding properties and/or 
biological activities. Methods for identifying candidate molecules are discussed in more detail below 

2. Identification of Candidate Molecules 

15 [0254] It is contemplated that the design of candidate molecules of interest can be facilitated by conventional ball 
and stick-type modeling procedures. However, in view of the size and complexity ot the large ribosomal subunit. it is 
contemplated that the ability to design candidate molecules may be enhanced significantly using computer-based 
modeling and design protocols. 

20 a. Molecular Modeling. 

[0255] It is contemplated that the design of candidate molecules, as discussed in detail hereinbelow, can be facilitated 
using conventional computers or workstations, available commercially from, for example, Silicon Graphics Inc. and 
Sun Microsystems, running, for example. UNIX based. Windows NT on IBM OS/2 operating systems, and capable of 

25 running conventional computer programs for molecular modeling and rational drug design. 

[0256] It is understood that any computer system having the overall characteristics set forth in Figure 27 may be 
useful in the practice of the invention. More specifically, Figure 27. is a schematic representation of a typical computer 
work station having in electrical communication (100) with one another via. for example, an internal bus or external 
network a central processing unit (101). a random access memory (RAM) (102), a read only memory (ROM) (103), a 

30 monitor or terminal (104). and optimally an external storage device, for example, a diskette. CD ROM. or magnetic 
tape (105). 

[0257] The computer-based systems of the invention preferably comprise a data storage means having stored therein 
a ribosome or ribosomal subunit or fragment sequence and/or atomic co-ordinate/X-ray diffraction data of the present 
invention and the necessary hardware means and software means for supporting and implementing an analysis means, 

35 As used herein, "a computer system" or "a computer-based system" refers to the hardware means, software means, 
and data storage means used to analyze the sequence, X-ray diffraction data, and/or atomic co-ordinates of the in- 
vention. As used herein the term "data storage means" is understood to refer to any memory which can store sequence 
data, atomic co-ordinates, and/or X-ray diffraction data, or a memory access means which can access manufactures 
having recorded thereon the atomic co-ordinates of the present invention. 

40 [0258] In one embodiment, a ribosome or ribosomal subunit. or at least a subdomain thereof, amino acid and nucleic 
acid sequence. X-ray diffraction data and/or atomic co-ordinates of the present invention are recorded on computer 
readable medium. As used herein, the term "computer readable medium" is understood to mean any medium which 
can be read and accessed directly by a computer. Such media include, but are not limited to: magnetic storage media, 
such as floppy discs, hard disc storage medium, and magnetic tape: optical storage media such as optical discs or 

-ts CD-ROM: electrical storage media such as RAM and ROM; and hybrids of these categories such as magnetic/optical 
storage media. A skilled artisan can readily appreciate how any of the presently known computer readable mediums 
can be used to create a manufacture comprising computer readable medium having recorded thereon an amino acid 
and/or nucleotide sequence. X-ray diffraction data, and/or atomic co-ordinates of the present invention. 
[0259] As used herein, the term "recorded" is understood to mean any process for storing information on computer 

so readable medium. A skilled artisan can readily adopt any of the presently known methods for recording information on 
computer readable medium to generate manufactures comprising an amino acid or nucleotide sequence, atomic co- 
ordinates and/or X-ray diffraction data of the present invention. 

[0260] A variety of data storage structures are available to a skilled artisan for creating a computer readable medium 
having recorded thereon amino acid and/or nucleotide sequence, atomic co-ordinates and/or X-ray diffraction data of 
55 the present invention. The choice of the data storage structure will generally be based on the means chosen to access 
the stored information. In addition, a variety of data processor programs and formats can be used to store the sequence 
information. X-ray data and/or atomic co-ordinates of the present invention on computer readable medium. The fore- 
going information, data and co-ordinates can be represented in a word processing text file, formatted in commercially- 
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available software such as WordPerfect and MICROSOFT Word, or represented in the form of an ASCII file, stored in 
a database application, such as DB2. Sybase. Oracle, or the like. A skilled artisan can readily adapt any number of 
data processor structuring formats (e.g. text file or database) in order to obtain computer readable medium having 
recorded thereon the information of the present invention. 

5 [0261] By providing a computer readable medium having stored thereon a ribosome or ribosomal subunit sequence, 
and/or atomic co-ordinates, a skilled artisan can routinely access the sequence, and/or atomic co-ordinates to model 
a ribosome or ribosomal subunit. a subdomain thereof, mimetic, or a ligand thereof. Computer algorithms are publicly 
and commercially available which allow a skilled artisan to access this data provided in a computer readable medium 
and analyze it for molecular modeling and/or RDD. See. e.g., Biotechnology Software Directory . MaryAnn Liebert Publ. , 

10 New York : NY (1995). 

[0262] Although computers are not required, molecular modeling can be most readily facilitated by using computers 
to build realistic models of a ribosome, ribosomal subunit, or a portion thereof. Molecular modeling also permits the 
modeling of new smaller molecules, for example ligands. agents and other molecules, that can bind to a ribosome, 
ribosomal subunit. or a portion therein. The methods utilized in molecular modeling range from molecular graphics (/' 
*5 a, three-dimensional representations) to computational chemistry (i.e., calculations of the physical and chemical prop- 
erties) to make predictions about the binding of the smaller molecules or their activities; to design new molecules; and 
to predict novel molecules, including ligands such as drugs, for chemical synthesis. 

[0263] For basic information on molecular modeling, see, for example. M. Schlecht, Molecular Modeling on the PC 
(1 998) John Wiley & Sons: Gans et ai, Fundamental Principals of Molecular Modeling (1 996) Plenum Pub. Corp.; N. 
20 C. Cohen, ed. . Guidebook on Molecular Modeling in Drug Design (1 996) Academic Press; and W.B. Smith, Introduction 
to Theoretical Organic Chemistry and Molecular Modeling (1996). U.S. Patents which provide detailed information on 
molecular modeling include, for example: U.S. Patent Nos. 6,093,573; 6,080,576; 6.075,014; 6,075,123; 6,071700; 
5,994 ; 503: 5,884.230; 5,612.894: 5.583,973; 5.030,103; 4 ; 906,122; and 4,812,12. 

[0264] Three-dimensional modeling can include, but is not limited to, making three-dimensional representations of 

25 structures, drawing pictures of structures, building physical models of structures, and determining the structures of 
related ribosomes, ribosomal subunits and ribosome/ligand and ribosomal subunit/ligand complexes using the known 
co-ordinates. The appropriate co-ordinates are entered into one or more computer programs for molecular modeling, 
as known in the art. By way of illustration, a list of computer programs useful for viewing or manipulating three-dimen- 
sional structures include: Midas (University of California, San Francisco): MidasPlus (University of California, San 

30 Francisco): MOIL (University of Illinois); Yummie (Yale University): Sybyl (Tripos, Inc.); Insight/Discover (Biosym Tech- 
nologies): MacroModel (Columbia University); Quanta (Molecular Simulations. Inc.): Cerius (Molecular Simulations, 
Inc.): Alchemy (Tripos. Inc.): Lab Vision (Tripos. Inc.); Rasmol (Glaxo Research and Development): Ribbon (University 
of Alabama): NAOMI (Oxford University); Explorer Eyechem (Silicon Graphics. Inc.); Univision (Cray Research); Mols- 
cript (Uppsala University); Chem-3D (Cambridge Scientific): Chain (Baylor College of Medicine); O (Uppsala Univer- 

35 sity): GRASP (Columbia University); X-Ptor (Molecular Simulations. Inc.: Yale University); Spartan (Wavefunction, Inc.); 
Catalyst (Molecular Simulations, Inc.): Molcadd (Tripos. Inc.): VMD (University of lllinois/Beckman Institute); Sculpt 
(Interactive Simulations, Inc.): Procheck (Brookhaven National Library): DGEOM (QCPE): RE_VIEW (Brunell Univer- 
sity); Modeller (Birbeck College, University of London); Xmol (Minnesota Supercomputing Center); Protein Expert 
(Cambridge Scientific): HyperChem (Hypercube): MD Display (University of Washington); PKB (National Center for 

*o Biotechnology Information, NIH); ChemX (Chemical Design. Ltd.): Cameleon (Oxford Molecular, Inc.); and Iditis (Oxford 
Molecular. Inc.). 

[0265] One approach to RDD is to search for known molecular structures that might bind to a site of interest. Using 
molecular modeling. RDD programs can look at a range of different molecular structures of molecules that may fit into 
a site of interest, and by moving them on the computer screen or via computation it can be decided which structures 
•*5 actually fit the site well (William Bains (1998) Biotechnology from A to Z. second edition. Oxford University Press, p. 
259). 

[0266] An alternative but related approach starts with the known structure of a complex with a small molecule tigand 
and models modifications of that small molecule in an effort to make additional favorable interactions with a ribosome 
or ribosomal subunit. 

so [0267] The present invention permits the use of molecular and computer modeling techniques to design and select 
novel molecules, such as antibiotics or other therapeutic agents, that interact with ribosomes and ribosomal subunits. 
Such antibiotics and other types of therapeutic agents include, but arc not limited to, antifungals, antivirals, antibactc- 
rials. insecticides, herbicides, miticides, rodentcides. etc. 

[0268] In order to facilitate molecular modeling and/or RDD the skilled artisan may use some or all of the atomic co- 
55 ordinates deposited at the RCSB Protein Data Bank with the accession numbers PDB ID: 1 FFK, 1JJ2. 1 FF2, or IFG0, 
and/or those atomic co-ordinates contained on Disk No 1, 2 or 3 of 3. Furthermore, the skilled artisan, using the 
foregoing atomic co-ordinates, the skilled artisan can generate additional atomic co-ordinates via. for example, mo- 
lecular modeling using, for example, homology modeling and/or molecular replacement techniques, that together define 
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at least a portion of a model of a ribosome from another species of interest. By using the foregoing atomic co-ordinates, 
the skilled artisan can design inhibitors of protein synthesis that may be tailored to be effective against ribosomes from 
one or more species but which have little or no effect on ribosomes of other species Such inhibitors may be competitive 
inhibitors. As used herein . the term "competitive inhibitor" refers to an inhibitor that binds to the active form of a ribosome 
5 or ribosomal subunit at the same sites as its substrate(s) or tRNA(s). thus directly competing with them. Tne term 
"active form" of a ribosome or ribosomal subunit refers to a ribosome or ribosomal subunit in a state that renders it 
capable of protein synthesis. Competitive inhibition can be reversed completely by increasing the substrate or tRNA 
concentration. 

[0269] This invention also permits the design of molecules that act as uncompetitive inhibitors of protein synthesis. 

io As used herein, the term "uncompetitive inhibitor refers to a molecule that inhibits the functional activity of a ribosome 
or ribosomal subunit by binding to a different site on the ribosome or ribosomal subunit than does its substrates, or 
tRNA. Such inhibitors can often bind to the ribosome or ribosomal subunit with the substrate or tRNA and not to the 
ribosome or ribosomal subunit by itself. Uncompetitive inhibition cannot be reversed completely by increasing the 
substrate concentration. These inhibitors may bind to. all or a portion of, the active sites or other regions of the large 

15 ribosomal subunit already bound to its substrate and may be more potent and less non-specific than known competitive 
inhibitors that compete for large ribosomal subunit active sites or for binding to large ribosomal subunit. 
[0270] Similarly, non-competitive inhibitors that bind to and inhibit protein synthesis whether or not it is bound to 
another chemical entity may be designed using the atomic co-ordinates of the large ribosomal subunits or complexes 
comprising large ribosomal subunit of Ihis invenlion. As used herein, the term "non-competitive inhibitor" refers to an 

20 inhibrtoi that can bind to either the free or substrate or tRNA bound form of the ribosome or ribosomal subunit. 

[0271] Those of skill in the art may identify inhibitors as competitive, uncompetitive, or non-competitive by computer 
fitting enzyme kinetic data using standard equation according to Segel. I.H, (1975) Enzyme Kinetics: Behaviour and 
Analysis of Rapid Equilibrium and Steady-State Enzyme Systems . (Wiley Classics Library). It should also be understood 
that uncompetitive or non-compctitivc inhibitors according to the present invention may bind the same or different 

25 binding sites. 

[0272] Alternatively, the atomic co-ordinates provided by the present invention are useful in designing improved 
analogues of known protein synthesis inhibitors or to design novel classes of inhibitors based on the atomic structures 
and co-ordinates of the crystals of the 50S ribosomal subunit/CCdA-p-Puro complex and the SOS ribosomal subunit/ 
aa-tRNA analogue complex. This provides a novel route for designing inhibitors of protein synthesis with both high 

30 specificity, stability and other drug-like qualities (Lipinski el al. (1997) Adv. Drug Deliv. Rev. 23:3). 

[0273] The atomic co-ordinates of the present invention also permit probing the three-dimensional structure of a 
ribosome or ribosome subunit or a portion thereof with molecules composed of a variety of different chemical features 
to determine optimal sites for interaction between candidate inhibitors and/or activators and the ribosome or ribosomal 
subunit. For example, high resolution atomic co-ordinates based on X-ray diffraction data collected from crystals sat- 

35 urated with solvent allows the determination of where each type of solvent molecule sticks. Small molecules that bind 
to those sites can then be designed and synthesized and tested for their inhibitory activity (Travis. J. (1993) Science 
262: 1374). Further, any known antibiotic, inhibitor or other small molecule that binds to the H. marismortui large subunit 
can be soaked into H. marismortui large subunit crystals and their exact mode of binding determined from difference 
electron density maps. These molecules may represent lead compounds from which better drug-like compounds can 

-to be synthesized. 

b. Identification of Target Sites. 

[0274] The atomic co-ordinates of the invention permit the skilled artisan to identify target locations in a ribosome or 
•*5 large ribosomal subunit that can serve as a starting point in rational drug design. As a threshold matter, the atomic co- 
ordinates of. the invention permit the skilled artisan to identify specific regions within a ribosome or ribosomal subunit 
that are involved with protein synthesis and/or protein secretion oul of the ribosome. Furthermore, the atomic co- 
ordinates of the invention permit a skilled artisan to further identify portions of these regions that are conserved or are 
not conserved between different organisms. For example, by identifying portions of these regions that are conserved 
50 among certain pathogens, for example, certain prokaryotes. but are not conserved in a host organism, for example, a 
eukaryote. more preferably a mammal, the skilled artisan can design molecules that selectively inhibit or disrupt protein 
synthesis activity of the pathogen's but not the host's ribosomes. Furthermore, by analyzing regions that arc cither 
conserved or non-conserved between certain pathogens, it may be possible to design broad or narrow spectrum protein 
synthesis inhibitors, e.g.. antibiotics, as a particular necessity arises. 
55 [0275] Figure 28. is a schematic representation of a large ribosomal subunit that identifies a variety of exemplary 
target sites that appear to participate in protein synthesis within the ribosome and/or the export or translocation of the 
newly synthesized protein out of the ribosome The target sites include, for example, the P-site (200) the A-site (201 ), 
the peptidyl transferase center (202). the peptidyl transferase site (203) which includes at least a portion of the P-site 
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and the A-site a factor binding domain (204) including, for example, the EF-Tu binding domain and the EF-G binding 
domain, the polypeptide exit tunnel (205) including cavities defined by the wall of the exit tunnel, and the signal rec- 
ognition particle binding domain (206). 

[0276] By way of example, inspection of the atomic co-ordinates of the H. mahsmortui SOS ribosomal subunit has 
5 identified a variety of target regions that may serve as a basis for the rational drug design of new or modified protein 
synthesis inhibitors. The target regions include the peptidyl transferase site. A-site ; the P-site, the polypeptide exit 
tunnel, certain cavities disposed in the wall of the polypeptide exit tunnel (for example, cavity 1 and cavity 2). and 
certain antibiotic binding pockets. The residues that together define at least a portion of each of the foregoing regions 
are identified in the following tables. However, it is contemplated that the same or similar target sites can be identified 
10 in a ribosome or a ribosomal unit of interest using the principles described herein. Furthermore, these principles can 
be employed using any of the primary sets of atomic co-ordinates provided herein or any additional atomic co-ordinate 
sets ; for example, secondary atomic co-ordinate sets that may be generated by molecular modeling of any ribosome 
or ribosomal subunit of interest. 

[0277] Table 5 identifies the residues in the H. mahsmortui 50S ribosomal subunit that together define at least a 
*5 portion of the ribosomal peptidyl transferase site. In addition, Table 5 identities which of those residues that define at 
least a portion of the peptidyl transferase site are not conserved between H. mahsmortui and E. colt, those that are 
not conserved between H. mahsmortui and rat. those that are not conserved between E. coti and rat. and those that 
are not conserved between eubacteria and eukaryota. The non-conserved residues were identified by comparison of 
sequences of H. mahsmortui 23S rRNA that form the above-mentioned sites with the corresponding sequences of 
20 aligned rRNA from the other organisms. 



Table 5 





Residues that Define the Ribosomal Peptidyl-Transferase Site 




H. marismortui A- 
site Residues 


Residues Not 
Conserved 
between H. 
marismortui and E. 
coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


Residue Conserved 
Between Bacteria 
and Eukaryotes? 


30 


G2102 








Yes 




A2103 








Yes 




C2104 








Yes 


35 


C2105 








Yes 


C2106 








Yes 




G2482 








Yes 




G2284 








Yes 


40 


G2285 








Yes 




G2286 








Yes 




A2474 








Yes 


45 


A2485 








Yes 




A2486 








Yes 




C2487 








Yes 




A2488 




U 


A/U 


No 


50 


U2528 








Yes 




C2536 








Yes 




A2538 








Yes 


55 


G2540 








Yes 




U2541 








Yes 




C2542 








Yes 



40 
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TablcS (continued) 



Residues that Define the Ribosomal Peptidyl-Transferase Site 


H. marismortui A- 
site Residues 


Residues Not 
Conserved 
between H. 
marismortui and E. 
coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coti 
and Rattus (Coli/ 
Rat) 


Residue Conserved 
Dei ween tsaciena 
and Eukaryotes? 


G2543 








Yes 


G2588 








Yes 


U2589 








Yes 


U2590 








Yes 


C2608 








Yes 


G2617 








Yes 


G2618 








Yes 


U2619 








Yes 


U2620 








Yes 


A2635 








Yes 


C2636 








Yes 


A2637 








Yes 


G2638 








Yes 


Residues were determined by a 5.8 angstrom distance measurement between the atoms of the CCdA-p-Puromycin 
A-site ligand and CCdA-Puromycin transition state inhibitor (PDB accession codes: 1fg0 and 1ffz. respectively) and 
the SOS ribosome using the program SPOCK a . Conserved residues were determined by comparison between the 
proposed secondary structures of H. marismortui. E. colfc, and Ratios norvegicus 0 . 



a) Christopher. Jon A. (1998) SPOCK: The Structural Properties Observation and Calculation Kit (Program Manual). The Center tor Macromolecular 
Design, Texas ASM University, College Station. TX. 

b) Comparative RNA Web Site URL http://www.ma.icmb.utexas.edu/ 



^ 5 c) Wool. I.G. (1985). "Studies of the Structure ot Eukarvotic (Mammalian) Ribosomes" in Structure, Function, and Genetics ol Ribosomes . supra. 

pp. 391-411. 



[0278] Table 6 identifies the residues in the H. marismortui SOS ribosomal subunit that together define at least a 
portion of the ribosomal A-site. In addition, Table 6 identifies which of those residues that define at least a portion the 
40 A-site are not conserved between H. marismortui and E. coli. those that are not conserved between H. marismortui 
and rat. those that are not conserved between E. coli and rat. and those that are not conserved between eubactena 
and eukaryota. The non conserved residues were identified as described previously with respect to Table 5. 



Table 6 



45 


Residues that Define the Ribosomal A-site 


50 


H. marismortui A- 
site Residues 


Residues Not 
Conserved 
between H. 
marismortui and 
E. coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


ResidueConserved 
Between Bacteria 
and Eukaryotes? 




G2102 








Yes 




A2103 








Yes 


55 


C2104 








Yes 




G2482 








Yes 
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Table 6 (continued) 





Residues that Define the Ribosomal A-site 




H. marismortui A- 


Residues Not 


Residues Not 


Residues Not 


ResidueConserv d 


5 


site Residues 


Conserved 


Conserved 


C n served 


Between Bacteria 






between H. 


Between H. 


Between E. coli 


and Eukaryotes? 






marismortui and 


marismortui and 


and Rattus (Coli/ 








E. coli 


Rattus 


Rat) 




10 


A2485 








Yes 


A2486 








Yes 




C2487 








Yes 




A2488 




U 


A/U 


No 


15 


U2528 








Yes 




C2536 








Yes 




A2538 








Yes 


20 


G2540 








Yes 




U2541 








Yes 




C2542 








Yes 




G2543 








Yes 


25 


G2588 








Yes 




U2589 








Yes 




U2590 








Yes 


30 


C2608 








Yes 




G2617 








Yes 




G2618 








Yes 




U2619 








Yes 


35 


U2620 








Yes 




A2637 








Yes 




Residues were determined by a 5.8 angstrom distance measurement between the atoms of the CC-Puromycin A- 




site ligand and the SOS ribosome {PDB accession code 1fg0) using the program SPOCK 8 . 


Conserved residues 


40 


were determined by comparison between the proposed secondary structures of H. mahsmortuP, E. colP, and Rattus 




norvegicusfi. 











e) Christopher. Jon A. 0998) SPOCK: The Structural Properties Observation and Calculation Kit (Program Manual), The Center for Macromolecular 
Design. Texas ASM University. College Station. TX. 

b) Comparative RNA Web Site URL http://www.rna.tcmb.ulexas.edu/ 

c) Woo). I.G (1985). "Studies of the Structure of Eukaryotic (Mammalian) Ribosomes" in Structure. Function, and Genetics of Ribosomes . supra, 
pp 391-411. 



[0279] Table 7 identifies the residues in the H. marismortui SOS ribosomal subunit that together define at least a 
portion of the ribosomal P-site. As demonstrated in Table 7. all of the residues in that portion of the ribosomal P-sites 
are conserved between H. marismortui and E. coli, between H. marismortui and rat. between E. coli and rat. and 
between eubacleria and eukaryola. as determined using the comparison method described previously with respect to 
Table 5. 
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Table 7 





Residues that Define the Ribosomal P-site 


5 


H. msrismortui P- 
site Residues 


Residues Not 
Conserved 
between H. 

1 1 Id 1 lol 1 IU l LU 1 ailU 

E. coli 


Residues Not 
Conserved 
Between H. 

mnrlcmortul snd 

Rattus 


Residues N t 
Conserved 
Between E. cofiand 
Rattus fColl/Ratl 


R sidue Conserved 
Between Bacteria 
and Eukaryotes? 


10 


C2104 








Yes 




C2105 








Yes 




C2106 








Yes 


15 


G2284 








Yes 


G2285 








Yes 




G2286 








Yes 




A2474 








Yes 


20 


A2485 








Yes 




A2486 








Yes 




U2619 








Yes 


25 


U2620 








Yes 




A2635 








Yes 




C2636 








Yes 




A2637 








Yes 


30 


G2638 








Yes 


35 


Residues were determined by a 5.8 angstrom distance measurement between the atoms of the CCA-P0 2 moiety 
of the CCdA-p-puromycin transition state inhibitor and the SOS ribosome (PDB accession code tffz) using the 
program SPOCK 3 . Conserved residues were determined by comparison between the proposed secondary 
structures of H. marismortui, E colP. and Rattus norvegicusP. 



a) Christopher. Jon A. (1998) SPOCK: The Structural Properties Observation and Calculation Kit (Program Manual). The Center for Macromolecular 
Design, Texas ASM University. College Station. TX. 

b) Comparative P.NA Web Site URL http://www.rna.icmb.utexas.edu/ 

c) Wool. I.G. (1985). "Studies of the Structure of Eukaryotic (Mammalian) Rlbosomes'' in Structure. Function, and Genetics of Rlbosomes . supra, 
pp 391-411 

[0280] Table 8 identifies the residues in the H. marismortui SOS ribosomal subunit that together define at least a 
portion of the ribosomal polypeptide exit tunnel. In addition, Table 8 identifies which of those residues that define at 
least a portion of the polypeptide exit tunnel are not conserved between H. marismortui and E. coli, those that are not 
conserved between H. marismortui and rat. those that are not conserved between E. coli and rat. and those that are 
not conserved between eubacteria and eukaryota. The non conserved residues were identified as described previously 
with respect to Table 5. 
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Table 8 





Residues that Define the Ribosomal Peptide Exit Tunnel 


5 


n. marismonui a- 


P o i f4 1 lac Mnt 


Daci/4iiac Mot 


n aiuuca i%ui 


Residue 




site Residues 


Conserved 


Conserved 


Conserved 


Conserved 






between H. 


Between H. 


Between £. coti 


Between Bacteria 






marlsmortul and E. 


marlsmortul and 


ana nanus (1*011/ 








coli 


Rattus 


Rat) 




JO 


23S rRNA 












G23 








Yes 




G24 








Yes 


15 


A60 








Yes 


GB8 








Yes 




G89 








Yes 




A90 








Yes 


20 


U454 








Yes 




A462 


C 




C/A 


No 




A466 








Yes 


25 


G467 


c 


A 


C/A 


No 




U468 


c 


C 




Yes 




G469 








Yes 




A476 








Yes 


30 


A477 








Yes 




C478 


A 


A 




Yes 




G487 








Yes 


35 


U488 


A 


A 




Yes 




A489 








Yes 




C490 








Yes 




C491 




U 


C/U 


No 


40 


C492 




u 


C/U 


No 




A497 








Yes 




A498 




G 


A/G 


No 


45 


G499 




A 


G/A 


No 




G500 












G501 




A 


G/A 


No 




A513 




G 


A/G 


No 


SO 


G514 


A 




A/G 


No 




G636 




A 


G/A 


No 




A767 








Yes 


55 


U768 


A 


A 




Yes 




U835 


del 




det/U 


No 
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Tabic 8 (continued) 







Residues that Define the Ribosomal Peptide Exit Tunnel 





5 


H. marismortui A- 
sue nesiuueb 


Residues Not 
Conserved 
between H. 
marismortui and E. 
coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


Residue 
Conserved 
Between Bacteria 
and Eukaryotes? 


10 


23S rRNA 












C839 


U 


G 


U/G 


No 




U840 








Yes 




A841 


G 


G 




Yes 


1 D 


A 844 








Yes 




U845 


A 


A 




Yes 




C879 








Yes 


20 


A882 








Yes 




U883 








Yes 




C884 








Yes 




G885 


A 


C 


A/C 


No 


25 


U888 


C 


C 




Yes 




C889 








Yes 




C890 


G 


A 


G/A 


No 


30 


U1359 








Yes 


C1360 


G 




G/C 


No 




C1361 




U 


C/U 


No 




U1362 








Yes 


35 


G1363 








Yes 




G1364 


A 




A/G 


No 




C1365 




U 


C/U 


No 


40 


C1366 


A 


G 


A/G 


No 




A1367 


U 


G 


U/G 


No 




A 1424 


U 


G 


U/G 


No 




G1425 


c 


U 


C/U 


No 


45 


C1426 




G 


C/G 


No 




A 1427 








Yes 




C1428 


A 

A 


A 




Yes 


50 


U1429 


C 


c 




Yes 




G1430 




A 


G/A 


NO 




C1439 


G 


A 


G/A 


No * 




U1440 


G 


C 


G/C 


No 


55 


G1441 


C 


A 


C/A 


No 




A 1442 




U 


A/U 


No 



45 
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Table 8 (continued) 





Residues that Define the Ribosomal Peptide Exit Tunnel 


5 


H. marism rtui A- 
site Residues 


Residues Not 
Conserv d 
between H 
maris mortui and E. 
coli 


Residues Not 
Conserv d 

Rpluuopn H 
□ciwvvli r i . 

marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


R sidue 
Cons rved 
Between Bacteria 
and Eukaryotes? 


10 


23S rRNA 










A 1689 




C 


A/C 


NO 




C1690 




A 


C/A 


No 




A1691 




G 


A/G 


No 


15 


A 1836 








Yes 




G1837 


U 


A 


U/A 


No 




U1838 








Yes 


20 


A2054 








Yes 




A2055 








Yes 




C2056 


A 


A 




Yes 




U2057 




C 


U/C 


No 


25 


G2058 


U 


C 


u/c 


NO 




C2098 


G 


A 


G/A 


No 




G2099 


A 




A/G 


No 


30 


A2100 








Yes 




A2101 








Yes 




G2102 








Yes 




A2103 








Yes 


jo 


C2104 








Yes 




C2105 








Yes 




A2474 








Yes 


40 


C2475 








Yes 




C2476 


U 




U/C 


No 




C2477 




A 


C/A 


No 


45 


U2478 


C 


C 




Yes 


G2482 








Yes 




A2485 








Yes 




A2486 








Yes 


50 


C2487 








Yes 




A2488 




U 


A/U 


NO 




C2536 








Yes 


55 


A2538 








Yes 


G2540 








Yes 




U2541 








Yes 
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Tabic 8 (continued) 



Residues that Define the Ribosomal Peptide Exit Tunnel 


H. marismortui A- 
site Residues 

23S rRNA 


Residues Not 
Conserved 
between H. 
marismortui and E. 
coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


Residue 

1/DnaciVcU 

Between Bacteria 
and Eukaryotes? 


C2542 








Yes 


C2608 








Yes 


G2611 








Yes 


G2616 








Yes 


G2618 








Yes 


U2619 








Yes 


U2620 








Yes 


U2621 








Yes 


A2637 








Yes 


G2643 








Yes 


C2644 


U 


U 




Yes 


U2645 


c 




C/U 


No 


G2646 


c 


U 


c/u 


No 


C2647 




u 


C/U 


No 












Protein L4 










E59 








Yes 


S60 


V 


s 


v/s 


No 


F61 


T 


w 


T/W 


No 


G62 








Yes 


S63 




T 


S/T 


No 


G64 








Yes 


R65 


K 




K/R 


No 


G66 


DEL 


A 


DEL/A 


NO 


Q67 


DEL 


V 


DEL/V 


No 


A68 


DEL 




DEL/A 


No 


H69 


K 


□ 
n 


rv n 


No 


V70 


P 


I 


p/l 


No 


P71 


W 


P 


W/P 


No 


K72 


R 


R 




Yes 


L73 


K 


R 


K/R 


No 


D74 


G 


G 




Yes 


G75 


T 


G 


T/G 


No 
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Tabic 8 (continued) 





Residues that Define the Ribosomal Peptide Exit Tunnel 




H. marismortui A- 


Residues Not 


Residues Not 


Residues Not 


Residue 


5 


site Residues 


Conserved 


Conserved 


Conserved 


Conserved 






between H 


Between H 


Between E. coli 


Between Bacteria 






marismortui and E. 


marismortui and 


and Rattus (Coli/ 


and Eukaryotes? 






coli 


Rattus 


Rat) 




10 


Protein L4 










R76 


G 


G 




Yes 




A77 


R 


T 


R/T 


No 














15 


Protein L22 












E20 


H 


N 


H/N 


No 




E121 


S 


K 


S/K 


No 


20 


Q122 


M 


M 




Yes 




Q123 


K 


R 


K/R 


No 




G124 


R 


R 




Yes 




R125 


I 


R 


l/R 


No 


25 


K126 


M 


T 


M/T 


No 




P 127 




Y 


P/Y 


No 




R128 








Yes 


30 


A129 








Yes 




M130 


K 


H 


K/H 


No 




G131 








Yes 




R132 








Yes 


35 


A133 




1 


A/I 


No 




S134 


D 


N 


D/N 


No 




A135 


R 


P 


R/P 


No 


40 


W136 


I 


Y 


l/Y 


No 




N137 


L 


M 


L/M 


No 




Q140 


T 


P 


T/P 


No 














45 


Protein L39E ** 












N18 












S19 










50 


R20 












V21 












P22 












A23 











"Given that homologues for the H. marismortui ribosomal protein L39E are not currently known, sequence comparisons with E. coli. and Rattus 
norvegicus are not possible. 
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Tabic 8 (continued) 





Residues that Define the Ribosomal Peptide Exit Tunnel 




H. marismortui A- 


Residu s Not 


Residues Not 


Residues Not 


Residue 


5 


site Residues 


Conserved 


Conserved 


Conserved 


Conserved 






between H. 


Between H. 


Between E. coli 


Between Bacteria 






marismortui and E. 


marismortui and 


and Rattus (Coli/ 


and Eukaryotes? 






coli 


Rattus 


Rat) 




10 


Protein L39E ** 












Y24 












V25 












M26 










15 


L27 












K28 












T29 










20 


E31 










R35 












N36 












H37 










25 


K38 












R39 












R40 










30 


H41 












R44 












N45 












Residues were determined by a 10 angstrom distance measurement between the atoms of a model of a newly 


35 


synthesized peptide positioned in the center of the exit tunnel and the SOS ribosome using the program SPOCK a . 
Conserved residues were determined by comparison between the proposed secondary structures of H. 




marismortui, E. coiP, and Rattus norvegicus 0 







"Given that homoloques tor the H. marismortui ribosomal protein L39E are not currently known, sequence comparisons with E. coli, and Rattus 



norvegicus are not possible. 

40 a) Christopher, Jon A. (1998) SPOCK: The Structural Properties Observation and Calculation Kit (Program Manual). The Center for Macromolecular 

Design, Texas A&M University. College Station, TX. 

b) Comparative RNA Web Site URL http://www.rna.icmb.uteKas.edu/ 

c) Wool, I.G. (1985). "Studies ol the Structure of Eukaryotic (Mammalian) Ribosomes" in Structure, Function, and Genetics of Ribosomes . supra, 
pp. 391-411. 

45 

[0281] Figure 26 shows a region of the large ribosomal subunit in which an antibiotic binds. Figure 26(A) shows an 
enlarged portion of the large ribosomal subunit with the antibiotic tylosin bound at the top of the polypeptide exit tunnel 
adjacent the peptidyl transferase site. Figures 26(B) and 26(C) are views showing each half of a large ribosomal subunit 
cut along the polypeptide exit tunnel and are provided to orient the reader to show the tylosin binding site relative to 
the large ribosomal unit as a whole. Figure 26(A) also shows two cavities defined by the wall of the polypeptide exit 
tunnel and are denoted as "cavity 1 " and "cavity 2." In addition, Figure 26(A) also shows a disaccharide binding pocket. 
The direction in which the newly synthesized polypeptide chains exits the ribosome through the polypeptide exit tunnel 
is denoted by an arrow. 

[0282] Table 9 identifies the residues in the H. marismortui SOS ribosomal subunit that together define a first cavity 
within the wall of polypeptide exit tunnel (cavity 1). In addition, Table 9 identifies which of those residues that define 
cavity 1 are not conserved between H. marismortui and E. coli, those that are not conserved between H. marismortui 
and rat. those that are not conserved between E. co// and rat. and those that are not conserved between eubacteria 
and eukaryota. The non-conserved residues were identified as described previously with respect to Table 5. 
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Table 9 





Residues that Define Cavity 1 in the Ribosomal Peptide Exit Tunnel 




5 


H. marismortui 


riesiuues Not 


KeSluU S riOl 


ncoluu o Paul 


Residu Conserved 




Residues 


Conserved 


Conserved 


Conserved 


Between Bacteria 






between H. 


Between H. 


Between E. coti 


and Eukaryotes? 






marismortui and 


marismortui and 


ana nanus (uoii/ 








E. coli 


Rattus 


Rat) 




10 














C474 


G 


G 




Yes 




A766 








Yes 


15 


A767 








Yes 


U768 


A 


C 


A/C 


No 




U883 








Yes 




C884 








Yes 


20 


G885 


A 




A/G 


No 




A886 








Yes 




U888 


C 


C 




Yes 


25 


C889 








Yes 


C890 


G 


A 


G/A 


No 




U1359 








Yes 




G1837 


U 


A 


U/A 


No 


30 


A2100 








Yes 




A2101 








Yes 




G2102 








Yes 


35 


A2103 








Yes 




C2475 








Yes 




C2476 


u 




U/C 


No 




C2477 




A 


C/A 


No 


40 


U2478 


c 


C 




Yes 




A2479 


G 




G/A 


No 




A2538 








Yes 


45 














Protein L4 












P57 


R 


S 


R/S 


No 




A58 








Yes 


50 


E59 








Yes 




S60 


V 




V/S 


No 




F61 


T 


W 


T/W 


No 


55 


G62 








Yes 




S63 




T 


S/T 


No 
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Tabic 9 (continued) 



Residues that Define Cavity 1 in the Ribosomal Peptide Exit Tunnel 


H. marismortui 
Residues 


Residues Not 
Conserved 
between H. 
marismortui and 
E. coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


Residue Conserved 
Dei ween Datici m 
and Eukaryotes? 


Protein L4 










G64 








Yes 


R65 


K 




K/R 


No 


Q67 


DEL 


V 


DEL/V 


No 


V70 


P 


I 


P/l 


No 


P71 


W 




W/P 


No 


K72 


R 


R 




Yes 


L73 


K 


R 


K/R 


No 


D74 


G 


G 




Yes 


G75 


T 




T/G 


No 


R76 


G 


G 




Yes 












Cavity residues were identified using the program SPOCK a . Conserved residues were determined by comparison 
between the proposed secondary structures of H. marismortui, E. colP, and Rattus norvegicu&. 



a) Chnslopher. Jon A (1P98) SPOCK The Structural Properties Observation and Calculation Kit (Program Manual), The Center for Macromolecular 



i 20 Design. Texas ASM University. College Station. TX. 

' b) Comparative RNA Web Site URL http://www ma.icmb.utexas.edu/ 

' c) Wool. I G. (1985). "Studies ot the Structure of Eukaryotic (Mammalian) Ribosomes" in Structure. Function, and Genetics ot Ribo somes, supra, 

j pp. 391-411. 

j 

35 [0283] Table 10 identifies the residues in the H. marismortui SOS ribosomal subunit that together define a second 
cavity in the wall of polypeptide exit tunnel (cavity 2). In addition, Table 1 0 identifies which of those residues that define 
cavity 2 are not conserved between H. marismortui and E. coli. those that are not conserved between H. marismortui 
and rat. those that ate not conserved between E. coli and tat. and those that are not conserved between eubactena 
! and eukaryota. The non conserved residues were identified as described previously with respect to Table 5. 



Table 10 



Residues that Define Cavity 2 in the Ribosomal Peptide Exit Tunnel 


H. marismortui 
Residues 


Residues Not 
Conserved 
between H. 
marismortui and 
E. coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


Residue Conserv d 
Between Bacteria 
and Eukaryotes? 












UB31 


C 


G 


C/G 


No 


U832 


DEL 




DEL/U 


No 


G833 


U 


C 


U/C 


No 


G834 


A 


C 


A/C 


No 


U835 


DEL 




DEL/U 


No 


G836 


A 




A/G 


No 
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Tabic 10 (continued) 





Residues that Define Cavity 2 in the Ribosomal Peptide Exit Tunnel 


5 


H. marismortui 
Residues 


Residues Not 
Conserv d 
between H. 
marismortui and 
E. coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


Residue Conserved 
Between Bacteria 
and Eukaryotes? 


JO 


U837 




A 


U/A 


No 




C838 


G 




G/C 


No 




C839 


U 


G 


U/G 


No 




U840 








Yes 


75 


A841 


G 


G 




Yes 




A8A3 








Yes 




A844 








Yes 


20 


U845 


A 


A 




Yes 


A846 




U 


A/U 


No 




C847 


U 




u/c 


No 




C848 


U 


G 


U/G 


No 


25 


C849 


A 


G 


A/G 


No 




C1753 








Yes 




A1754 








Yes 


30 


G1837 


U 


A 


U/A 


No 




U1 838 








Yes 




A 1839 








Yes 




G2099 


A 




A/G 


No 


35 


A2100 








Yes 




A2103 








Yes 




U2615 








Yes 


10 


G2616 








Yes 




U2621 








Yes 




A2622 








Yes 




G2643 








Yes 


45 


C2644 


U 


U 




Yes 




U2645 


c 




c/u 


No 




G2646 


c 


U 


c/u 


No 


50 


C2647 




U 


c/u 


No 




Cavity residues were determined using the program SPOCK a . Conserved residues were determined by comparison 
between the proposed secondary structures of H. marismortui. E. coif 3 , and Rattus norvegicus*. 



a) Christopher. Jon A. (1998) SPOCK: The Structural Properties Observation and Calculation Kit (Program Manual), The Center lor Macromolecular 
Design. Texas A&M University. College Station. TX. 

b) Comparative RNA Web Site URL hllp.//www.rna. icmb.utexas.edu/ 

c) Wool. I.G. (1985). "Studies of the Structure of Eukaryotic (Mammalian) Ribosomes* in Structure. Function, and Genetics of Ribo somes, supra, 
pp. 391*411. 
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[0284] Tables 9 and 10. however, define only two or many cavities disposed within the wall of the polypeptide exit 
tunnel. However, by using the atomic co-ordinates and molecular modeling methodologies described herein, the skilled 
artisan may identify the residues (contributed by amino acids, nucleotides or a combination of both) that together define 
other cavities within the wall of the polypeptide exit tunnel. 

5 [0285] In addition, by using the atomic co-ordinates described herein, the skilled artisan can identify the antibiotic 
binding site of any antibiotic of interest. This information also provides contact sites between an antibiotic and the 
residues in a ribosome or ribosomal subunit, which can be used to advantage in the design of novel or modified protein 
synthesis inhibitors. The binding or contact sites for a variety of antibiotics are discussed in more detail below. 
[0286] Table 11 identifies the residues in the H. marismortui SOS ribosomal subunit that together define at least a 

w portion of an anisomycin binding pocket. In addition. Table 11 identifies which of tnose residues that define at least a 
portion of the anisomycin binding pocket are not conserved between H. marismortui and E. coli. those that are not 
conserved between H. marismortui and rat, those that are not conserved between E. coli and rat. and those that are 
not conserved between eubacteria and eukaryota. The non-conserved residues were identified as described previously 
with respect to Table 5. 

15 



Table 11 



Residues that Define the Anisomycin Binding Pocket 


H. marismortui 
Residues 


Residues Not 
Conserved 
between H. 
marismortui and E. 
coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Colt/ 
Rat) 


ResidueConserved 
Between Bacteria 
and Eukaryotes? 


G2102 








Yes 


G2482 








Yes 


A2486 








Yes 


C2487 








Yes 


A2488 




U 


A/U 


No 


U2535 








Yes 


A2538 








Yes 


U2539 








Yes 


G2540 








Yes 


U2541 








Yes 


Residues were determined by a 5.8 angstrom distance measurement between the atoms of anisomystn and the 
SOS ribosome using the program SPOCK 3 . Conserved residues were determined by comparison between the 
proposed secondary structures of H. marismortuP. E. colP. and Rattus norvegicus*. 



a) Christopher, Jon A. (1998) SPOCK: The Structural Properties Observation and Calculation Kit (Program Manual), The Center tor Macromolecular 
Design. Texas ASM University. College Station, TX. 

b) Comparative RNA Web Site URL http//www rna icmb utexas edu/ 

c) Wool. I.G. (1985). "Studies of the Structure of Eukaryotic (Mammalian) Ribosomes" in Structure Function, and Genetics of Ribo somes, supra, 
pp. 391-411. 



[0287] Table 12 identifies the residues in the H. marismortui SOS ribosomal subunit that together define at least a 
portion of a blasticidin binding pocket. As demonstrated in Table 12, all of the residues in that portion of the blasticidm 
binding pocket are conserved between H. marismortui and E. coli, between H. marismortui and rat, between E. coli 
and rat. and between eubacteria and eukaryota, as determined using the comparison method described previously 
with respect to Table 5. 
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Table 12 





Residues that Define the Blasticiciin Binding Pocket 


5 


H mnricmnrttii 
n . ii icii iai i iui iu i 


Restdu s Not 


RpqiHups Not 


Residues Not 


Residue 




Residues 


Conserved 


Conserved 


Conserved 


Conserved 






between H. 


Between H. 


Between E. coli 


Between Bacteria 






marismortul and E. 


marlsmortui and 


onH Dottnc /r* o 11/ 
afiu rlallUS (vviu 


anH Fill/ aruntpc^ 
aiiu tunaiyuLca . 






coli 


Rattus 


Rat) 




10 














C2104 








Yes 




C2105 








Yes 


15 


C2106 








Yes 


G2284 








Yes 




G2285 








Yes 




U2473 








Yes 


20 


A2474 








Yes 




A2485 








Yes 




A2486 








Yes 


25 


U2620 








Yes 




G2634 








Yes 




A2635 








Yes 




C2636 








Yes 


30 


A2637 








Yes 




Residues were determined by a 5.8 angstrom distance measurement between the atoms of Blasticidin and the SOS 




ribosome using the program SPOCK 8 . Conserved residues were determined by comparison between the proposed 




secondary structures of H. marismortui . E. coif 3 , and Rattus norvegicus* . 





35 a) Christopher. Jon A. (1998) SPOCK: The Structural Properties Observation and Calculation Kit (Program Manual). The Center for Macromolecular 

Design, Texas ASM University, College Station. TX. 

b) Comparative RNA Web Site URL http://www.ma.icmb.utexas.edu/ 

c) Wool. I.G (1985). "Studies ot the Structure of Eukaryotic (Mammalian) Ribosomes" in Structure. Function, and Genetics ol Ribo somes, supra, 
pp. 391-411 



[0288] Table 13 identifies the residues in the H. marismortui SOS ribosomal subunit that together define at least a 
portion of a carbomycin binding pocket. In addition. Table 13 identifies which of those residues that define at least a 
portion of the carbomycin binding pocket are not conserved between K marismortui and £7. coli those that are not 
conserved between H. marismortui and rat, those that are not conserved between E. coli and rat. and those that are 
not conserved between eubacteria and eukaryota. The non-conserved residues were identified as described previously 
with respect to Table 5. 



Table 13 



Residues that Define the Carbomycin Binding Pocket 


H. marismortui 
Residues 


Residues Not 
Conserved 
between H. 
marismortui and E. 
coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


Residue Conserved 
Between Bacteria 
and Eukaryotes? 












C839 


U 


G 


U/G 


No 
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Tabic 13 (continued) 



Residues that Define the Cartoomycin Binding Pocket 


H. marismortui 
Residues 


Residues Not 
Conserved 
between H. 
marismortui andE. 
coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


Residue Conserved 

DclWccn DdCici la 

and Eukaryotes? 


G2099 


A 




A/G 


NO 


A2100 








Yes 


G2102 








Yes 


A2103 








Yes 


C2104 








Yes 


A2486 








Yes 


C2487 








Yes 


A2538 








Yes 


G2540 








Yes 


U2541 








Yes 


U2620 








Yes 


C2644 


U 


U 




Yes 


G2646 


C 


u 


C/U 


No 


Residues were determined by a 5.8 angstrom distance measurement between the atoms of carbomycin and the 
SOS ribosome using the program SPOCK 8 . Conserved residues were determined by comparison between the 
proposed secondary structures of H. marismortui, E. colt*, and Rattus norvegicus*. 



a) Christopher. Jon A. (1998) SPOCK: The Structural Properties Observation and Calculation Kil (Program Manual). The Center tor Macromolecular 
Design, Texas A&M University, College Station. TX. 

b) Comparative RNA Web Site URL hrtp7/www rna icmb utexns edu/ 

c) Wool. I.G. (1985). "Studies of the Structure of Eukaryotic (Mammalian) Ribosomes" in Structure. Function, and Genetics ol Ribosomes . supra. 



35 pp. 391-411. 

[0289] Table 14 identifies the residues in the H. marismortui SOS ribosomal subunit that together define at least a 
portion of a tylosin binding pocket. In addition. Table 1 4 identifies which of those residues that define at least a portion 
of the tylosin binding pocket are not conserved between H. marismortui and E. coli, those that are not conserved 
4 ° between H. marismortui and rat, those that are not conserved between E. coliand rat, and those that are not conserved 
between eubacteria and eukaryota. The non-conserved residues were identified as described previously with respect 
to Table 5. 



Table 14 



Residues that Define the Tylosin Binding Pocket 


H. marismortui 
Residues 


Residues Not 
Conserved 
between H. 
marismortui and E. 
coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


Residue 
Conserved 
Between Bacteria 
and Eukaryotes? 












C839 


U 


G 


U/G 


No 


A841 


G 


G 




Yes 


A843 








Yes 
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Tabic 14 (continued) 





Residues that Define the Tylosin Binding Pocket 




H. marism rtui 


Residues Not 


Residues Not 


Residu s Not 


Residue 


5 


Residues 


Conserved 


C nserved 


Conserved 


Conserved 






between H. 


Between H. 


Between E. coli 


Between Bacteria 






marismortui and E. 


marismortui and 


and Rattus (Coli/ 


and Eukaryotes? 






coli 


Rattus 


Rat) 




10 


A844 








Yes 


U845 


A 


A 




Yes 




G1837 


U 


A 


U/A 


No 




C2098 


G 


A 


G/A 


No 


15 


G2099 


A 




A/G 


No 




A2100 








Yes 




G2102 








Yes 


20 


A2103 








Yes 




A2538 








Yes 




U2539 








Yes 




G2540 








Yes 


25 


U2541 








Yes 




U2645 


C 




C/U 


No 




G2646 


C 


U 


c/u 


No 


30 


Residues were determined by a 5.8 angstrom distance measurement between the atoms of tylosin and the SOS 


ribosome using the program SPOCK 3 . Conserved residues were determined by comparison between the proposed 




secondary structures ol H. marismortui. E. coli, and Rattus norvegicusF. 





a) Christopher, Jon A. (1998) SPOCK: The Structural Properties Observation and Calculation Kil (Program Manual), The Center lor Macromolecular 
Design. Texas ASM University. College Station, TX. 

b) Comparative RNA Web Site URL http://www.rna.icmb.utexas.edu/ 

c) Wool. I.G. (1985). "Studies ot the Structure of Eukaryotic (Mammalian) Ribosomes" in Structure. Function, and Genetics ol Ribosomes, supra. 
pp 391-411 



[0290] Tabic 15 identifies the residues in the H. marismortui SOS ribosomal subunit that together define at toast a 
portion of a sparsomycin binding pocket. As demonstrated in Table 15. all of the residues in that portion of the spar- 
somycin binding pocket are conserved between H. marismortui and E. coli, between H. marismortui and rat. between 
E. coli and rat. and between eubacteria and eukaryota. as determined using the comparison method described previ- 
ously with respect to Table 5. 



Table 15 



45 


Residues that Define the Sparsomycin Binding Pocket 


50 


H. marismortui 
Residues 


Residues Not 
Conserved 
between H. 
marismortui and E. 
coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coil/ 
Rat) 


Residue 
Conserved 
Between Bacteria 
and Eukaryotes? 














55 


A2486 








Yes 


C2487 








Yes 




U2541 








Yes 
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Tabic 15 (continued) 



Residues that Define the Sparsomycin Binding Pocket 


H. marismortui 
Residues 


Residues Not 
Conserved 
between H. 
marismortui and E. 
coli 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


Residue 

n n e a r\f Arl 

uonservea 
Between Bacteria 
and Eukaryotes? 


C2608 








Yes 


U2619 








Yes 


U2620 








Yes 


C2636 








Yes 


A2637 








Yes 


Residues were determined by a 5.8 angstrom distance measurement between the atoms of sparsomycin and the 
50S ribosome using the program SPOCK 3 . Conserved residues were determined by comparison between the 
proposed secondary structures of H. marismortui, E. cotP, and Rattus norvegicus*. 



20 a) Christopher. Jon A. (1998) SPOCK. The Structural Properties Observation and Calculation Kit (Program Manual). The Center (or Macromolecular 

Design. Texas ASM University. College Station, TX 

b) Comparative RNA Web Site URL: http://www.rna. icmb.utexas.edu/ 

c) Wool. l.G. (1985). "Studies of the Structure of Eukaryotic (Mammalian) Ribosomes" in Structure, Function, and Genetics of Ribosomes , supra, 
pp. 391-411. 

[0291] Table 16 identifies the residues in the H. marismortui SOS ribosomal subunit that together define at least a 
portion of a virginiamycin binding pocket. As demonstrated in Table 16. all of the residues in that portion of the virgin- 
iamycin binding pocket are conserved between H. marismortui and E. coli, between H. marismortui and rat. between 
E. coli and rat. and between eubacteria and eukaryota : as determined using the comparison method described previ- 
ously with respect to Table 5. 



Table 16 



55 



Residues that Define the Virginiamycin Binding Pocket 


H. marismortui 
Residues 


Residues Not 
Conserved 
between H. 
marismortui and E. 
coil 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


Residue 
Conserved 
Between Bacteria 
and Eukaryotes? 












A2100 








Yes 


G2102 








Yes 


A2103 








Yes 


C2104 








Yes 


C2105 








Yes 


G2482 








Yes 


A2486 








Yes 


C2487 








Yes 


U2535 








Yes 


C2536 








Yes 


A2538 








Yes 


U2539 








Yes 
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Tabic 16 (continued) 





Residues that Define the Virginiamycin Binding Pocket 


5 


H. marismortui 
R sidues 


Residues N t 
Conserved 
between H. 
marismortui and E. 
coil 


Residues Not 
Cons rv d 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coli 
and Rattus (Coli/ 
Rat) 


Residue 
C nserved 
Between Bacteria 
and Eukaryotes? 


10 


G2540 








Yes 


U2541 








Yes 




U2620 








Yes 


15 


Residues were determined by a 5.8 angstrom distance measurement between the atoms of virginiamycin and the 
50S ribosome using the program SPOCK 3 , Conserved residues were determined by comparison between the 
proposed secondary structures of H. marismortui, E. coli, and Rattus norvegicusF : . 



a) Christopher. Jon A. (1998) SPOCK: The Structural Properties Observation and Calculation Kit (Program Manual). The Center tor Macromolecular 
Design, Texas ASM University. College Station, TX. 

b) Comparative RNA Web Site URL http://www.rna.icmb.utexas.edu/ 

c) Wool, I G (1985) "Studies ot the Structure of Eukaryotic (Mammalian) Ribosomes" in Structure. Function, and Genetics of Ribosomes , supm, 
pp. 391-411. 



[0292] Table 17 identifies the residues in the H. marismortui SOS ribosomal subunit that together define at least a 
portion of a spiramycin binding pocket. In addition. Table 17 identifies which of those residues that define at least a 
portion of the spiramycin binding pocket are not conserved between H. marismortui and E. coli, those that are not 
conserved between H. marismortui and rat, those that are not conserved between E. coli and rat and those that are 
not conserved between eubacteria and eukaryota. The non-conserved residues were identified as described previously 
with respect to Table 5. 



30 






Table 17 








Residues that Define the Spiramycin Binding Pocket 


35 


H. marismortui 
Residues 


Residues Not 
Conserved 
between H. 
marismortui and E. 
coil 


Residues Not 
Conserved 
Between H. 
marismortui and 
Rattus 


Residues Not 
Conserved 
Between E. coil 
and Rattus (Coli/ 
Rat) 


Residue 
Conserved 
Between Bacteria 
and Eukaryotes? 














40 


C839 


U 


G 


U/G 


No 


C2098 


G 


A 


G/A 


No 




G2099 


A 




A/G 


No 




A2100 








Yes 


•45 


G2102 








Yes 




A2103 








Yes 




A2538 








Yes 


50 


U2539 








Yes 


G2540 








Yes 




U2541 








Yes 




C2644 


U 


U 




Yes 


55 


G2646 


C 


U 


C/U 


No 
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Tabic 17 (continued) 



Residues that Define the Spiramycin Binding Pocket 


H. marismortui 


Residues Not 


Residues Not 


Residues Not 


Residue 


Residues 


Conserved 


Conserved 


Cons rved 


Conserved 




between H. 


Between H. 


Between E. coil 


Between Bacteria 




marismortui and E. 


marismortui and 


and Rattus (Coli/ 


and Eukaryotes? 




coil 


Rattus 


Rat) 




Residues were determined by a 5.8 angstrom distance measurement between the atoms of spiramycin and the SOS 


ribosome using the program SPOCK 3 Conserved residues were determined by comparison between the proposed 


secondary structures of H. marismortui E. coli. and Rattus norvegicu&. 





10 



15 



a) Christopher, Jon A. (1998) SPOCK: The Structural Properties Observation and Calculation Kit (Program Manual). The Center for Macromolecular 
Design, Texas A&M University, College Station, TX. 

b) Comparative RNA Web Site URL:htlp://www.ma.icmb.utexas.edu/ 

c) Wool, I.G. (1985). "Studies of the Structure of Eukaryotic (Mammalian) Ribosomes" in Structure, Function, and Genetics of Ribosomes , supra. 
pp. 391-411. 



20 



25 



30 



35 



40 



50 



55 



[0293] The skilled artisan, when in possession of the foregoing or other exemplary target sites, may use the process 
of rational drug design to identify molecules that potentially bind to one or more of the target sites and/or inhibit ribosoma! 
activity. Furthermore, by taking into account which of the residues that define the target site are conserved between 
pathogens but not conserved between host species, the skilled artisan can design new species-specific protein syn- 
thesis inhibitors. It is apparent that the skilled artisan can take advantage of the regions that are not conserved between 
E. coli and rat to provide target regions for rational drug design By way of example. Figure 29 shows certain regions 
of the polypeptide exit tunnel that are conserved between E. coli and rat (denoted in red) and regions of the polypeptide 
exit tunnel that are not conserved between E. coli and rat (denoted in blue). Figures 29(A) and 29(B) provide enlarged 
views of a large ribosomal subunit when cut in half along the polypeptide exit tunnel. Figure 29(C) is provided to orient 
the reader to the view in Figure 29(A) relative to the large ribosomal subunit. Figure 29(D) is provided to orient the 
reader to the view in Figure 29B relative to the large ribosomal subunit. In addition, the skilled artisan when in possession 
of mutations that prevent or reduce antibiotic activity {i.e.. are related lo antibiotic resistance) can use this information 
to model the relevant antibiotic binding product which can then be used as a basis for rational drug design to identify 
small molecules that overcome drug resistance. It is contemplated that a variety of computer modeling procedures, 
for example, homology modeling protocols, can be used to provide a model of a drug resistance target site by imple- 
menting site directed mutagenesis of nucleotides and/or amino acids and then using the appropriate energy minimi- 
zation and refinement protocols. 

c. identification of Candidate Molecules. 



[0294] It is contemplated that candidate molecules that inhibit protein biosynthesis can be designed entirely de novo 
or may be based upon a pre-existing protein biosynthesis inhibitor. Either of these approaches can be facilitated by 
computationally screening databases and libraries of small molecules for chemical entities, agents, iigands. or com- 
pounds that can bind in whole, or in part, to ribosomes and ribosomal subunits. more preferably to large ribosomal 
subunits. and even more preferably to SOS ribosomal subunits In this screening, the quality of fit of such entities or 
compounds to the binding site or sites may be judged either by shape complementarity or by estimated interaction 
energy (Meng etal. (1992) J. Coma. Chem. 13: 505-524). 

[0295] The design of molecules that bind to or inhibit the functional activity of ribosomes or ribosomal subunits ac- 
cording to this invention generally involves consideration of two factors. First, the molecule must be capable of physically 
and structurally associating with the large ribosomal subunit. Non-covalent molecular interactions important in the 
association of ribosomes and ribosomal subunits with the molecule, include hydrogen bonding, van der Waals and 
hydrophobic interactions. Second, the molecule must be able to assume a conformation that allows it to associate with 
the ribosomes or ribosomal subunits. more preferably with the large ribosomal subunits. and even more preferably 
with the SOS ribosomal subunit. Although certain portions of the molecule may not directly participate in this association 
with a ribosome or ribosomal subunits those portions may still influence the overall conformation of the molecule. This, 
in turn, may have a significant impact on binding affinities, therapeutic efficacy, drug-like qualities, and potency. Such 
conformational requirements include the overall three-dimensional structure and orientation of the chemical entity or 
molecule in relation to all or a portion of the active site or other region of the ribosomes or ribosomal subunits. or the 
spacing between functional groups of a molecule comprising several chemical entities that directly interact with the 
ribosomes or ribosomal subunits. more preferably with the large ribosomal subunits. and even more preferably with 
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the SOS ribosomal subunit 

[0296] The potential, predicted, inhibitory or binding effect of a molecule on ribosomes and ribosomal subunits may 
be analyzed prior to its actual synthesis and testing by the use of computer modeling techniques. If the theoretical 
structure of the given molecule suggests insufficient interaction and association between it and ribosomes or ribosomal 

5 subunits. synthesis and testing of the molecule is obviated. However, if computer modeling indicates a strong interac- 
tion, the molecule may then be synthesized and tested for its ability to interact with the ribosomes or ribosomal subunits 
and inhibit protein synthesis. In this manner, synthesis of inoperative molecules may be avoided. In somecases. inactive 
molecules are synthesized predicted on modeling and then tested to develop a SAR (structure-activity relationship) 
for molecules interacting with a specific region of the ribosome or ribosomal subunit, more preferably of the large 

w ribosomal subunit. and even more preferably of the SOS ribosomal subunit. As used herein, the term "SAR", shall 
collectively refer to the structure-activity/structure property relationships pertaining to the relationship(s) between a 
compound's activity/properties and its chemical structure. 

d. De Novo Design. 

15 

[0297] One skilled in the art may use one of several methods to identify chemical moieties or entities, compounds, 
or other agents for their ability to associate with a preselected target site within a ribosomes or ribosomal subunit. This 
process may begin by visual inspection or computer assisted modeling of. for example, the target site on the computer 
screen based on the atomic co-ordinales of the SOS ribosomal subunil and/or its complexes with other analogues and 

20 antibiotics, deposited in the RCSB Protein Data Bank with accession numbers PDB ID: 1FFK, 1JJ2 ; 1FFZ. oi 1FG0 t 
and/or listed in a table contained on Disk No. 1, 2 or 3 of 3. In one embodiment, compound design uses computer 
modeling programs which calculate how different molecules interact with the various sites of the ribosome ribosomal 
subunit, or a fragment thereof. Selected chemical moieties or entities, compounds., or agents may then be positioned 
in a variety of orientations, or dockcd ; within at least a portion of the target site of a ribosome or ribosomal subunit. 

25 more preferably of a large ribosomal subunit and even more preferably of a SOS ribosomal subunit. Databases of 
chemical structures are available from, for example. Cambridge Crystallographic Data Center (Cambridge, U.K.) and 
Chemical Abstracts Service (Columbus. OH). Docking may be accomplished using software such as Quanta and Sybyl ; 
followed by energy minimization and molecular dynamics with standard molecular mechanics forcefields. such as 
CHARMM and AMBER 

30 [0298] Specialized computer programs may also assist in the process of selecting chemical entities. These include, 
but are not limited to: 

(1) GRID (Goodford. P. J.. "A Computational Procedure for Determining Energetically Favorable Binding Sites on 
Biologically Important Macromolecules" (19B5) J. Med. Chem. 28. 849-857). Software such as GRID, a program 

35 that determines probable interaction sites between probes with various functional group characteristics and the 

macromolecular surface, can be used to analyze the surface sites to determine structures of similar inhibiting 
proteins or molecules. The GRID calculations, with suitable inhibiting groups on molecules {e.g.. protonated primary 
amines) as the probe, are used to identify potential hotspots around accessible positions at suitable energy contour 
levels. GRID is available from Oxford University. Oxford, UK. 

40 

(2) MCSS (Miranker. A. and M. Karplus (1991 ) "Functionality Maps of Binding Sites: A Multiple Copy Simultaneous 
Search Method." Proteins: Structure, Function and Genetics 11: 29-34). MCSS is available from Molecular Simu- 
lations. Burlington, MA. 

•*5 (3) AUTODOCK (Goodsell. D. S. and A. J. Olsen (1 990) "Automated Docking of Substrates to Proteins by Simulated 

Annealing" Proteins: Structure, Function, and Genetics 8: 195-202). AUTODOCK is available from Scripps Re- 
search Institute. La Jolla. CA. 

(4) DOCK (Kuntz, I. D. et at. (1982) "A Geometric Approach to Macromolecule-Ligand Interactions" J. Mot. Biol. 
50 161. 269-288). The program DOCK may be used to analyze an active site or ligand binding site and suggest ligands 

with complementary steric properties. DOCK is available from University of California. San Francisco. CA. 

(5) ALADDIN (Van Drie et at. (1989) "ALADDIN: An Integrated Tool of Computer Assisted Molecular Design and 
Pharmacophore Recognition From Geometric, Steric and Substructure Searching of Three-Dimensional Struc- 

55 tures" J. Comp- Aided Moi Des. 3: 225). 

(6) CLIX (Davie and Lawrence (1992) "CLIX: A Search Algorithm for Funding Novel Ligands Capable of Binding 
Proteins of Known Three-Dimensional Structure" Proteins 12: 31-41). 
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(7) GROUPBUILD (Rotstetn and Murcko (1 993) "GroupBuild: A Fragment-Based Method for De Novo Drug Design" 
J. Med. Chem 36: 1700). 

(8) GROW (Moon and Howe (1991) "Computer Design of Bioactive Molecules: A Method for Receptor-Based De 
5 Novo Ligand Design" Proteins 11 : 314). 

[0299] Once suitable chemical moieties or entities, compounds, or agents have been selected, they can be assem- 
bled into a single molecule. Assembly may proceed by visual inspection and/or computer modeling and computational 
analysis of the spatial relationship of the chemical moieties or entities, compounds or agents with respect to one another 
io in three-dimensional space. This could then be followed by model building using software such as Quanta or Sybyl. 
[0300] Useful programs to aid one of skill in the art in connecting the individual chemical entities, compounds, or 
agents include but are not limited to: 

(1) CAVEAT (Bartlett, P. A. et al, (1989) "CAVEAT: A Program to Facilitate the Structure-Derived Design of Biolog- 
75 ically Active Molecules". In molecular Recognition in Chemical and Biological Problems", Special Pub.. Royal 

Chem. Soc. 78: 82-196) and (Bacon et ai (1992) J. Mot. Biol. 225: 849-858). CAVEAT uses databases of cyclic 
compounds which can act as "spacers" to connect any number of chemical fragments already positioned in the 
active site. This allows one skilled in the art io quickly generate hundreds of possible ways to connect the fragments 
already known or suspected to be necessary for lighl binding. 
20 CAVEAT is available from the University of California. Berkeley. CA. 

(2) 3D Database systems such as MACCS-3D (MDL Information Systems, San Leandro. (CA). This area is re- 
viewed in Martin. Y. C. ; (1992) "3D Database Searching in Drug Design", J. Med. Chem. 35: 2145-2154. 

25 (3) HOOK (available from Molecular Simulations. Burlington. MA.). 

[0301] Instead of proceeding to build a molecule of interest in a step-wise fashion one chemical entity at a time as 
described above, the molecule of interest may be designed as a whole using either an empty active site or optionally 
including some portion or portions of a known inhibitor or inhibitors. Software that implements these methods include: 

30 

(1) LUDI (Bohm. H.-J. (1992) "The Computer Program LUDI: A New Method for the De Novo Design of Enzyme 
Inhibitors". J. ComR. Aid. Molec. Design 6: 61-78) The program LUDI can determine a list of interaction sites into 
which to place both hydrogen bonding and hydrophobic fragments. LUDI then uses a library of approximately 600 
linkers to connect up to four different interaction sites into fragments. Then smaller "bridging" groups such as -CH 2 - 

35 and -COO- are used to connect these fragments. For example, for the enzyme DHFR. the placements of key 

functional groups in the well-known inhibitor methotrexate were reproduced by LUDI. See also, Rotstein and 
Murcko. (1992) J. Med. Chem. 36:1700-1710. LUDI is available from Biosym Technologies, San Diego. CA. 

(2) LEGEND (Nishibata. Y. and A. Itai (1991) Tetrahedron 47, 8985). LEGEND is available from Molecular Simu- 
lations, Burlington, MA. 

(3) LeapFrog (available from Tripos Associates. St. Louis. MO.). 

(4) Aladdin (available from Daylight Chemical Information Systems. Irvine. CA) 

45 

[0302] Other molecular modeling techniques may also be employed in accordance with this invention. See, e.g., 
Cohen. N. C. et al. (1990) "Molecular Modeling Software and Methods for Medicinal Chemistry. J. Med. Chem. 33: 
883-894. See also, Navia. M. A. and M A. Murcko (1992) "The Use of Structural Information in Drug Design". Current 
Opinions in Structural Biology 2: 202-210; and Jorgensen (1 998) "BOSS- Biochemical and Organic Simulation 
50 System" in the Encyclopedia of Computational Chemistry (P.V.R. Schleyer, ed.) Wiley & Sonstra., Athens. U.S.A. 5: 
3281 -3285). 

[0303] tt is contemplated that during modeling, it may be possible to introduce into the molecule of interest, chemical 
moieties thai may be beneficial for a molecule that is to be administered as a pharmaceutical. For example, it may be 
possible to introduce into or omit from the molecule of interest, chemical moieties that may not directly affect binding 
55 of the molecule to the target area but which contribute, for example, to the overall solubility of the molecule in a phar- 
maceutical^ acceptable carrier, the bioavailability of the molecule and/or the toxicity of the molecule. 
Considerations and methods for optimi7ing the pharmacology of the molecules of interest can be found, for example, 
in "Goodman and Gilman's The Pharmacological Basis of Therapeutics" Eighth Edition (Goodman Gilman, Rail, Nies. 
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& Taylor (eds.)). Pergaman Press (1985); Jorgensen & Duffy (2000) Bioorg. Med. Chem. Lett. 10: 1155-1158. 
[0304] Furthermore, the computer program "Oik Prop" can be used to provide rapid predictions for physically signif- 
icant descriptions and pharmaceutically-relevant properties of an organic molecule of interest. A 'Rule of Five'probabiliiy 
scheme can be used to estimate oral absorption of the newly synthesized compounds (Lipinski etai (1997) Adv. Drug 
5 Detiv. Rev. 23:3). 

[0305] Programs suitable for pharmacophore selection and design include: 

(1) DISCO (Abbot Laboratories. Abbot Park. III.). 

10 (2) Catalyst (Bio-CAD Corp., Mountain View ; CA). 

(3) Chem DBS-3D (Chemical Design Ltd., Oxford. U.K.). 

[0306] Furthermore, the skilled artisan may use the information available on how to design suitable therapeutically 
*5 active and pharmaceutical^ useful compounds, and use this information in the design of new protein synthesis inhibitors 
of the invention. See. for example. Lipinski ef at. (1997) Ad. Drug Detiv. Reviews 23: 3-25: Van de Waterbeemd et at. 
(1996) Quantitative Structure-Activity Relationships 15: 480-490; and Cruciani ef at (2000). Theochem-J. Mot. Struct. 
503: 17-30. 

[0307] The entry of the co-ordinales of the ribosome's or ribosomal subunit's proteins and RNAs into the computer 
20 programs discussed above results in the calculation of most probable structure of the macromolecule ; including overall 
atomic co-ordinates of a ribosome, ribosomal subunit or a fragment thereof. These structures can be combined and 
refined by additional calculations using such programs to determine the probable or actual three-dimensional structure 
of the ribosome, ribosomal subunit or a fragment thereof, including potential or actual active or binding sites of ligands. 

25 e . Modification of Existing Molecules. 

[0308] Instead of designing molecules of interest entirely de novo it is contemplated that pre-existing molecules or 
proteins thereof may be used as a starting point for the design of a new candidate. It is contemplated that many of the 
approaches useful for designing molecules de novo may also be useful for modifying existing molecules. 

30 [0309] It is contemplated that knowledge of the spatial relationship between a protein biosynthesis inhibitor, for ex- 
ample, an antibiotic, and its respective binding site within a ribosome permits the design of modified inhibitors that may 
have better binding properties, for example, higher binding affinity and/or specificity, relative to the molecule from which 
it was derived. Alternatively, knowledge of inhibitor contact sites within a ribosome permits the synthesis of a new 
molecule that contain, for example, a portion of a first molecule that binds to the contact site and another portion that 

35 contributes additional functionality. 

[0310] It is contemplated that a variety of modified molecules (for example, modified antibiotics) may be designed 
using the atomic co-ordinates provided herein. For example, it is contemplated that by knowing the spatial relationship 
of one or more of antibiotics relative to the large ribosomal subunit it is possible to generate new antibiotic-based 
molecules. The atomic co-ordinates of each antibiotic relative to the large ribosomal subunit provides information on 

•to what portions of the ribosome or ribosomal subunit and the antibiotic contact one another. Accordingly, from this infor- 
mation the skilled artisan may not only identify contact locations within the ribosome that can be used for de novo drug 
design, as discussed above, but also may identify portions of an antibiotic that can act as a ribosome binding domain. 
[031 1 ] Based on the information provided herein, the skilled artisan may readily identify and produce hybrid antibiotics 
that comprise a ribosome binding domain of a first antibiotic and a ribosome binding domain of a second, different 

•ts antibiotic. The resulting hybrid antibiotics preferably bind to each of respective contact locations within the ribosomal 
subunit simultaneously. The atomic co-ordinates provided herein permit the skilled artisan to identify candidate antibi- 
otics thai may be used as templates in the synthesis of a hybrid, and also provide sleric information necessary lo 
produce linking chemistries such that each ribosome binding domain is properly orientated relative to its respective 
contact site. As a result, it is contemplated that the skilled artisan may produce a hybrid antibiotic that binds to a 

50 ribosome or ribosomal subunit with a higher affinity and/or have higher protein synthesis inhibitory activity than either 
of the individual template antibiotics used to generate the hybrid. Alternatively, the hybrid antibiotic may overcome 
resistance phenotypes that may have developed against cither of the template antibiotics. For example, the proximity 
of the site occupied by the disaccharide moiety of carbomycin to the site filled by anisomycin suggests that a hybrid 
compound including portions of both carbomycin and anisomycin may be an effective inhibitor of protein synthesis. 

55 [0312] Furthermore, the atomic co-ordinates provided herein permit the skilled artisan to use the information per- 
taining to identify a ribosome binding domain and to design other types of protein synthesis inhibitors. For example, 
with an understanding of the ribosome contact region and the surrounding environment, the skilled artisan can provide 
novel molecules, a portion of which is based upon the antibiotic binding region (binding domain) and another portion 
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of which (effector domain) can be designed as a novel space filling domain that sterically inhibits or disrupts protein 
biosynthesis within the ribosome or secretion through the polypeptide exit tunnel. For example, the skilled artisan may 
combine the ribosome binding region of the antibiotic, tylosin. which binds to one side of the polypeptide exit tunnel 
close to the peptidyl transferase site, with a novel chemical moiety bulky enough to block the polypeptide exit tunnel. 
5 However it is contemplated tnat the skilled artisan may take advantage of one or more of the many of the antibiotic 
contact regions disclosed herein to design entirely new binding and effector domains. 

[0313] Furthermore, the present invention permits the skilled artisan to design molecules, for example, selective 
protein synthesis inhibitors that are tailored to be more potent with respect to ribosomes of a target organism, for 
example, a pathogen such a microbe, and less potent, i.e., less toxic, to ribosomes of a non target organism, for 

10 example, host organism such as a human. Also, the invention permits the skilled artisan to use the atomic co-ordinates 
and structures of the large ribosomal subunit and its complexes with protein synthesis inhibitors to design modifications 
to starting compounds, such as an antibiotic, that will bind more tightly to a target ribosome (e.g., the SOS ribosomal 
subunit of bacteria) and less tightly to a non-targeted ribosome (e.g.. human 60S ribosomal subunit). 
[0314] The structure of a complex between the large ribosomal subunit and the starting compound [e.g., tylosin or 

15 another protein synthesis inhibitor) can also be used to guide the modification of that compound to produce new com- 
pounds that have other desirable properties for the applicable industrial and other uses {e.g., as pharmaceuticals, 
herbicides or insecticides), such as chemical stability, solubility or membrane permeability. 

[0315] A variety of antibiotics bind the large ribosomal subunit and disrupt protein synthesis and include members 
of antibiotic families which include, for example, chloramphenicols, macrolides. lincosamides. slreplogramins. althio- 

20 mycins. oxazotinones. nucleotide analogs, thiostreptons. peptides, glutarimides. and trichothecenes. 

[0316] MemDers of the chloramphenicol family include, for example. Chloramphenicol and lodoamphenicol. Mem- 
bers of themacrolide family include, for example. Biaxin (Clarithromycin). Zithromax (Azithromycins), Ketek (Telithro- 
mycin; ketolide). ABT-773. Tylosin. Spiramycin I. Spiramycin II, Spiramycin III, Erythromycin A. Carbomycin A. Telithro- 
mycin, Mcthymycin. Narbomycin. Lankamycin. Oleandomycin. Mcgalomycin. Chalcomycin. Niddamycin. Lcucomycin. 

25 Angolamycin. and Relomycin Members of the licosamide family include, for example. Clindamycin and Lincomycin. 
Members of the streptogramin family include, for example. Streptogramin A. Streptogramin B. Ostreogrycin G. Syner- 
cid. Virginamycin S1. Virginamycin S2 Virginamycin S3 : Virginamycin S4. Vernamycin B. Vernamycin C. Patricin A. 
and Patricin B. A member of the althiomycin family, includes, for example. Althiomycin. A member of the oxazolidine 
family, includes, for example, Linezolid Members of thefamily of nucleotide analogs include, for example. Sparsomycin. 

30 Puromycin. Anisomycin. and Blasticidin S. Members of the thiostrepton family include, for example. Thiostrepton. Si- 
omycin. Sporangiomycin. andThiopeptin. Members of the peptide family include, for example. Viomycin. Capreomycm 
IA. Capreomycin IB. Capreomycm MA. and Capreomycin MB Members of the glutarimide family include, for example. 
Cycloheximide. Streptovitacins. Streptimidone. Inactone. Actiphenol. Members of the trichothecene family include, for 
example. Trichodermin. Trichodermol. Trichodermone. Vomitoxin. T-2 toxin. Trichothecin. Nivalenol. and Verrucarin A. 

35 [0317] Inhibitors can be diffused into or soaked with the stabilized crystals of the large ribosomal subunit as described 
in Example 3 to form a complex with the large ribosomal subunit for collecting X-ray diffraction data. Alternatively, the 
inhibitors can be co-crystallized with the large ribosomal subunit by mixing the inhibitor with the large ribosomal subunit 
before precipitation with high salt. 

[0318] Starting with the structure of the ribosome from H. marismortui, the structure of the ribosome from a non- 
-to targeted organism (for example, the human 60S ribosomal subunit) can be constructed by homology modeling i.e., 
by changing the structure of residues at a target site of interest for the residues at the same positions in of the non- 
target ribosome. This is done computationally by removing the side chains from the ribosome of known structure and 
reolacmg them by the side chains of the unknown structure put in sterically plausible positions. In this way, it can be 
understood how the shapes of tne target sites within the targeted and non-targeted ribosomes differ. This process. 
•*s therefore, provides information concerning how a molecule that binds the target site can be chemically altered in order 
to produce molecules that will bind tightly and specifically to the targeted ribosome but will simultaneously be prevented 
from binding to the non-largeled ribosome. Likewise knowledge of portions of the bound molecules that face the 
solvent permit introduction of other functional groups for additional pharmaceutical purposes. The process of homology 
structure modeling can also be used to understand the mecnanisms whereby mutant ribosomes become resistant to 
so the effects of pharmaceuticals or pesticides, such as herbicides or insecticides. Furthermore, with knowledge of the 
portions of the ribosomal subunit that participates in drug resistance, the skilled artisan may design new molecules 
that overcome the problem of drug resistance. 

[0319] The use of homology structure modeling to design molecules that bind more tightly to the target ribosome 
than to the non-target ribosome has wide-spread applicability. The methods outlined herein can be used to control any 
55 targeted organism, for example, a pathogen, by designing molecules that inhibit large ribosomal subunits of the targeted 
organisms while failing to inhibit the SOS or 60S ribosomal subunit of the non-targeted organism, for example a host, 
to the same extent or not at all The molecules identified or prepared by the methods of the present invention can be 
used to control the targeted organisms while causing the non-targeted organism little or no adverse effects. Thus, the 
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molecules identified or developed using the methods of the present invention can be designed so that their adminis- 
tration kills the target organisms or inhibits some aspect of the biological functions of the target organisms while failing 
to have a similar effect on the non-targeted organism. The adverse effects of the agent on the targeted organisms may 
include, but are not limited to. death of the target organism: slowing growth rates: slowing or eliminating passage from 

5 one growth phase to another {e.g., extending the larval growth stage); slowing or eliminating reproduction, decreasing 
or preventing mating, decreasing or eliminating offspring production, limiting or eliminating target organism weight 
gains: decreasing or eliminating feeding ability and behaviors; and disrupting cellular, tissue and/or organ functions. 
[0320] The novel agents contemplated by the present invention can be useful as herbicides, pesticides (e.g . insec- 
ticides, nematocides. rodenticides. etc.). miticides. or antimicrobial agents (e.g., antifungals, antibacterials. antiproto- 

10 zoals, etc.) to target specific organisms. For example, the novel agents can target animal and plant parasitic nematodes, 
prokaryotic organisms (disease causing microbes), and eukaryotic multicellular pests. Specific examples of multicel- 
lular pests include, but are not limited to. insects, fungi, bacteria, nematodes, mites and ticks, protozoan pathogens, 
animal-parasitic liver flukes, and the like. 

[0321] Herbicides, pesticides, miticides. and antimicrobial agents that inhibit protein synthesis by interacting with 
is ribosomes are known to the skilled artisan. A tew examples are discussed below. These known agents can be modified 
to obtain novel agents by using computer modeling techniques and knowledge of the structure of ribosomes and ri- 
bosomal subunits and the structure of ribosome/agent and hbosomal subunit/agent complexes. 
[0322] The ketolide ABT-773 binds ribosomes tighter than erythromycin in S. pneumoniae and is able to defeat 
macrolide resistance in bacteria (Capobianco etai. (2000) Antimicrob Agents Chemother 44(6). 1562-1567). The tools 
20 and methodologies of the present invention can be used to obtain erythromycin derivatives that bind the ribosomes or 
ribosomal subunits of target bacteria more tightly than they bind the ribosomes and ribosomal subunits of non-target 
animals. The target bacteria can be any infectious bacteria, particularly S. pneumoniae, and even more particularly 
erythromycin -resistant S. pneumoniae. The non-target animals can be any animal, particularly mammals, and even 
more particularly humans. 

25 [0323] Examples of antibiotics that are inhibitors of protein synthesis include, but are not limited to, puromycin. cy- 
cloheximide. chloramphenicol, tetracycline, and streptomycin (Heldt, (1996) Plant Biochemistry and Molecular Biology 
21.2: 458-464). Puromycin. as discussed earlier, binds as an analogue of an aminoacyl-tRNA to the A-site and is added 
to nascent peptide chains, its weak associate with the ribosome prevents further elongation steps in prokaryotes and 
eukaryotes. Cycloheximide inhibits peptidyl transferase in eukaryotic ribosomes. 

30 Chloramphenicol inhibits peptidyl transferase in prokaryotic ribosomes. Tetracycline binds to the 30S subunit and in- 
hibits the binding of aminoacyl-tRNA to prokaryotic ribosomes much more than to eukaryotic ones. Streptomycin in- 
teracts with 30S ribosomes which results in an incorrect recognition of mRNA sequences and thus inhibits initiation in 
prokaryotic ribosomes. U.S. Patent No. 5.801,153 discloses antibiotics against pathogens. Aminoglycosides are ex- 
amples of antibacterial antibiotics that appear to inhibit protein synthesis. However, there is a limitation to their use 

35 because of their ototoxic and nephrotoxic properties. Amikacin sulfate, Framycetin sulfate, Gentamycin sulfate. Kan- 
amycin sulfate. Neomycin sulfate, Netilmicin sulfate, Paromomycin sulfate. Sissomycin sulfate, Tobramycin, Vanco- 
mycin hydrochloride, and Viomycin sulfate are the members of the aminoglycoside family. The tools and methodologies 
of the present invention can be used to obtain derivatives of any antibiotic of choice so that they inhibit the protein 
synthesis of target organisms to a greater degree than they inhibit the protein synthesis of non-target organisms, such 

^o as humans. 

[0324] Examples of targeted and non-targeted organisms include, but are not limited to, those provided in Table 1 8. 



Table 18 



Examples of Classes of Molecules which can be Identified and/or Developed by the Methods of the Invention and 

Applicable Target/Non-Target Organisms. 


Type of Molecule 


Target Organisms 


Non-Target Organisms 


Herbicides 


Dicotyledonous plants 


Monocotyledonous plants 


Herbicides 


Grasses 


Soybeans, potatoes, coffee 


Insecticides 


Flics. Mites 


Honey becs 


Pesticides 


Ticks 


Deer 


Pesticides 


Lice 


Birds 


Miticides 


Parasitic miles (mange) 


Dogs 
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Tabic 18 (continued) 



Examples of Classes of Molecules which can be Identified and/or Developed by the Methods of the Invention and 

Applicable Target/Non-Target Organisms. 


Type of Molecule 


Target Organisms 


Non-Target Organisms 


Antimicrobial Agents 
(Antibactcrials) 


Streptococcus pneumoniae 


Humans 


Antimicrobial Agents 
(Antibacterials) 


Clostridium difficile 


Escherichia coli 


Antimicrobial Agents (Antifungals) 


Erysiphe graminis 


Barley 


Antimicrobial Agents 
(Antiprotozoals) 


Toxoplasma gondii 


Animals 


Poisons (Rodentcides) 


Rats 


Dogs. cats, humans 



10 



15 



20 



30 



35 



40 



45 



50 



[0325] It is contemplated that the tools and methodologies of the present invention can be used to obtain inhibitors 
of protein synthesis of target insects, such as bollworms and mosquitoes, more than they inhibit the protein synthesis 
of non-target insects, such as beetles of the family Coccinellidae (e.g., ladybugs) and Apis mellifera {honey bees). 
Other possible target insects include, but are not limited to. insects selected from the orders Coleoptera (beetles). 
Dtptera (flies, mosquitoes). Hymenoptera (wasps, ants, sawflies). Lepidoptera (butterflies and moths). Mattophaga 
(lice). Homoptera (whiteflies. aphids), Hemiptera (bugs), Orthroptera (locusts cockroaches). Thysanoptera (thrips). 
Dermaptera (earwigs). Isoptera, Anoplura, Siphonaptera, and Trichoptera (caddis flies). 

[0326] Furthermore, it is contemplated that the tools and methodologies of the present invention can be used to 
obtain inhibitors of protein synthesis of target plants which inhibit protein synthesis of the target plants more than they 
inhibit the protein synthesis of non-target plants and animals. The target plants can be any unwanted plant species, 
particular weeds, and even more particularly noxious weeds. Whether or not a particular plant is considered a weed 
will depend upon the context in which it is growing. For example, unwanted Zea mays (corn) plants growing in a Glycine 
max (soybean) field could be considered unwanted weeds. Examples of weeds which are likely target plants include, 
but are not limited to. Allium vineale (wild garlic). Bromus tectorum (downy brome). Triticum cylindrtcum (jointed goat- 
grass). Amaranthus spp. (pigsweed). Chenopodium album (lambsquarters). Avena fatua (wild oats). B. secalinus 
(cheat). Echinochloa crus-galli (bamyardgrass). Alopecurus myosuroides (blackgrass). Setaria faberii (giant foxtail). 
Xanthium strumarium (common cocklebur). Ambrosia artemisiifolia (common ragweed), and Ipomoea spp. (morning 
glories). The norWargel organisms can be any plant, particularly any desirable plant, and even more particularly any 
crop plant. The non-target organisms can also be any animals, particularly mammals, and even more particularly 
humans. In one preferred embodiment, the tools and methodologies of the present invention can be used to produce 
protein synthesis inhibitors which kill or injure one or more noxious weed species but fail to harm non-target plants and 
animals. 

[0327] Target bacteria of interest include, but arc not limited to. Staphylococcus aureus, Streptococcus pyogenes, 
Streptococcus agafactiae. Streptococcus bovis, Streptococcus pneumoniae, Moraxella catarrhal, Neisseria gonor- 
rhoeae. Neisseria meningitides. Bacillus anthracis, Corynebacterium diphtheriae, Listeria monocytogenes, Erysip- 
elothrix rhusiopathiae, Clostridium perfringens, Clostridium tetani, Clostridium difficile, Eschericia coli, Proteus mira- 
bilis. Psuedomonas aeruginosa, Klebsiella pneumoniae, Haemophilus intluen7ae, Haemophilus ducreyi. Yersinia pes- 
tis, Yersinia enterocolitica, Francisella tularensis, Pasteurella multocida, Vibrio cholerae, Flavobacterium meningosep- 
ticum, Pseudomonas mallei, Pseudomonas pseudomatlei, Campylobacter jejuni, Campylobacter fetus, Fusobacterium 
nucteatum. Calymmatobacterium granutomatis, Streptobacillus moniliformis. Legionella pneumophila. Mycobacterium 
avium-intracellulare, Mycobacterium tuberculosis, Mycobacterium leprae, Treponema pallidum, Treponema pertenue, 
Borrelia burgdorferi, Borrelia recurrentis. Actinomyces isrealii, Nocardia asteroides, Ureaplasma urealyticum, Myco- 
plasma pneumoniae. Chlamydia psittaci, Chlamydia trachomatis, Chlamydia pnemoniae, Pneumocystis carinii, Coc- 
cidioides immitis, Histoplasma capsulatum, Blastomyces dermatitidis, Paracoccidioides brasiliensis, Sporothrix 
schenckii, Cryptococcus neoformans. 

[0328] Once a candidate molecule has been designed or selected by the above methods, the affinity with which that 
molecule may bind to the ribosome or ribosomal subunit may be tested and optimized by computational evaluation 
and/or by testing biological activity after synthesizing the compound. Candidate molecules may interact with the ribos- 
omes or ribosomal subunits in more than one conformation each of which has a similar overall binding energy. In those 
cases, the deformation energy of binding may be considered to be the difference between the energy of the free 
molecule and the average energy of the conformations observed when the molecule binds to the ribosomes or ribosomal 
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subunits. more preferably to the large ribosomal subunits. and even more preferably to the SOS ribosomal subunits. 
[0329] A molecule designed or selected as binding to a nbosome or ribosomal subunit may be f urther computationally 
optimized so that in its bound state it preferably lacks repulsive electrostatic interaction with the target region. Such 
non-complementary (e.g.. electrostatic) interactions include repulsive charge-charge, dipole-dipole andcharge-dipole 

5 interactions. Specifically, the sum of all electrostatic interactions between the inhibitor and the enzyme when the inhibitor 
is bound to the ribosome or the ribosomal subunit. preferably make a neutral or favorable contribution to the enthalpy 
of binding. Weak binding compounds can also be designed by these methods so as to determine SAR. 
[0330] Specific computer programs that can evaluate a compound deformation energy and electrostatic interaction 
are available in the art. Examples of suitable programs include: Gaussian 92. revision C (M. J. Frisch, Gaussian. Inc.. 

10 Pittsburgh. PA ): AMBER, version 4.0 (P. A. Kollman. University of California at San Francisco. CA): QUANTA'CHARMM 
(Molecular Simulations. Inc.. Burlington. MA); and Insight ll/Discover (Biosysm Technologies Inc.. San Diego, CA). 
These programs may be implemented, for instance, using a Silicon Graphics workstation, IRIS 4D/35 or IBM RISC/ 
6000 workstation model 550. Other hardware systems and software packages are known to those skilled in the art. 
[0331] Once a molecule of interest has been selected or designed, as described above, substitutions may then be 

15 made in some of its atoms or side groups in order to improve or modify its binding properties. Generally, initial substi- 
tutions are conservative, i.e.. the replacement group will approximate the same size, shape, hydrophobicity and charge 
as the original group. It should, of course, be understood that components known in the art to alter conformation should 
be avoided Such substituted chemical compounds may then be analyzed for efficiency of fit to the ribosome or ribos- 
omal subunit by the same computer methods described in detail, above. 

20 [0332] In addition, the actual ribosome-related ligands. complexes or mimetics may be crystallized and analyzed 
using X-ray diffraction. The diffraction pattern co-ordinates are similarly used to calculate the three-dimensional inter- 
action of a ligand and the ribosome. ribosomal subunit. or a mimetic, in order to confirm that the ligand binds to. or 
changes the conformation of. a particular site on the ribosome or ribosomal subunit. or where the mimetic has a similar 
three-dimensional structure to that of a ribosome, ribosomal subunit or a fragment thereof. 

3. Synthesis of Lead Molecules 

[0333] A lead molecule of the present invention can be. but is not limited to, at least one selected from a lipid, nucleic 
acid, peptide, small organic or inorganic molecule, chemical compound, element, saccharide, isotope, carbohydrate. 

30 imaging agent, lipoprotein, glycoprotein, enzyme, analytical probe, and an antibody or fragment thereof, any combi- 
nation of any of the foregoing, and any chemical modification or variant of any of the foregoing. In addition, a lead 
molecule may optionally comprise a detectable label. Such labels include, but are not limited to. enzymatic labels, 
radioisotope or radioactive compounds or elements, fluorescent compounds or metals, chemiluminescent compounds 
and bioluminescent compounds. Well known methods may be used for attaching such a detectable label to a lead 

35 molecule. 

[0334] Methods useful for synthesizing lead molecules such as lipids, nucleic acids, peptides, small organic or inor- 
ganic molecules, chemical compounds, elements, saccharides, isotopes, carbohydrates, imaging agents, lipoproteins, 
glycoproteins, enzymes, analytical probes antibodies, and antibody fragments are well known in the art. Such methods 
include the traditional approach of synthesizing one such lead molecule such as a single defined peptide, at a time, 
•to as well as combined synthesis of multiple lead molecules in a one or more containers. Such multiple lead molecules 
may include one or more variants of a previously identified lead molecule. Methods for combined synthesis of multiple 
lead molecules are particularly useful in preparing combinatorial libraries which may be used in screening techniques 
known in the art. 

[0335] By way of example, it is well known in the art that multiple peptides and oligonucleotides may be simultaneously 
43 synthesized. Lead molecules that are small peptides up to 50 amino acids in length, may be synthesized using standard 
solid-phase peptide synthesis procedures, for example, procedures similar to those described in Merrifield (1963) J. 
Am. Chem. Soc, 85; 2149. For example, during synthesis. N-u-prolecled amino acids having protected side chains 
are added stepwise to a growing polypeptide chain linked by its C-terminal end to an insoluble polymeric support, e. 
g.. polystyrene beads. The peptides are synthesized by linking an amino group of an N-a-deprotected amino acid to 
so an a-carboxy group of an N- a-protected amino acid that has been activated by reacting it with a reagent such as 
dicyclohexylcarbodiimide. The attachment of a free amino group to the activated carboxyl leads to peptide bond for- 
mation. The most commonly used N-a-protccting groups include Boc which is acid labile and Fmoc which is base labile. 
[0336] Briefly, the C-terminal fM -a-protected amino acid is first attached to the polystyrene beads. Then, the N-a- 
protecting group is removed. The deprotected a-amino group is coupled to the activated a-carboxylate group of the 
55 next N-a-protected amino acid. The process is repeated until the desired peptide is synthesized. The resulting peptides 
are cleaved from the insoluble polymer support and the amino acid side chains deprotected. Longer peptides, for 
example greater than about 50 amino acids in length, typically are derived by condensation of protected peptide frag- 
ments. Details of appropriate chemistries, resins, protecting groups, protected amino acids and reagents are well known 
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in the art and so are not discussed in detail herein. See for example, Atherton et al. (1 963) Solid Phase Peptide Syn- 
thesis: A Practical Approach (IRL Press), and Botianszky (1993) Peptide Chemistry. A Pra ctical Textbook. 2nd Ed. 
Springer-Verlag. and Fields et at. (1990) Int. J. Peptide Protein Res. 35:161-214. 

[0337] Purification of the resulting peptide is accomplished using conventional procedures, such as preparative 
5 HPLC. e.g., gel permeation, partition and/or ion exchange chromatography. The choice of appropriate matrices and 
buffers are well known in the art and so are not described in detail herein. 

[0338] It is contemplated that a synthetic peptide in accordance with the invention may comprise naturally occurring 
amino acids, unnatural amino acids, and/or amino acids having specific characteristics, such as. for example, amino 
'acids that are positively charged, negatively charged, hydrophobic, hydrophilic. or aromatic. As used herein, the term 

10 "naturally occurring amino acids" refers to the L-isomers of amino acids normally found in proteins. The predominant 
naturally occurring amino acids are glycine, alanine, valine, leucine, isoleucine. serine, methionine, threonine, pheny- 
lalanine, tyrosine, tryptophan, cysteine, proline, histidine. aspartic acid, asparagine, glutamic acid, glutamine. arginine, 
and lysine. Unless specifically indicated, all amino acids are referred to in this application are in the L-form . Furthermore, 
as used herein, the term "unnatural amino acids" refers to amino acids that are not naturally found in proteins. For 

J5 example, selenomethionine. 

[0339] Amino acids that are "positively charged" include any amino acid having a positively charged side chain under 
normal physiological conditions. Examples of positively charged naturally occurring amino acids include, for example, 
arginine, lysine, and histidine. 

Conversely, amino acids thai are "negatively charged" include any amino acid having a negatively charged side chains 
20 under normal physiological conditions. Examples of negatively charged naturally occurring amino acids include, for 
example, aspartic acid and glutamic acid. 

[0340] As used herein, the term "hydrophobic amino acid" includes any amino acids having an uncharged, nonpolar 
side chain that is relatively insoluble in water. Examples of naturally occurring hydrophobic amino acids include, for 
example, alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine. In addition, as used 
25 herein, the term "hydrophilic amino acid" refers to any ammo acids having an uncharged, polar side chain that is 
relatively soluble in water. Examples of naturally occurring hydrophilic amino acids include, for example, serine, thre- 
onine, tyrosine, asparagine. glutamine and cysteine. 

[0341] Finally, as used herein, the term "aromatic" refers to amino acid residues which side chains have delocalized 
conjugated system. Examples of aromatic residues include, for example, phenylalanine, tryptophan, and tyrosine. 
30 [0342] With regard to the production of non-peptide small organic molecules which act as a ligand in the present 
invention, these molecules can be synthesized using standard organic chemistries well known and thoroughly docu- 
mented in the patent and other literatures 

[0343] Many of the known methods useful in synthesizing lead of the present invention may be automated, or may 
otherwise be practiced on a commercial scale. As such once a lead molecule has been identified as having commercial 
35 potential, mass quantities of that molecule may easily be produced. 

4. Characterization of Molecules 

[0344] Molecules designed, selected and/or optimized by methods described above, once produced, may be char- 
ge acterized using a variety of assays known to those skilled in the art to determine whether the compounds have biological 

activity. For example, the molecules may be characterized by conventional assays, including but not limited to those 

assays described below, to determine whether they have a predicted activity, binding activity and/or binding specificity. 

[0345] Furthermore, high-throughput screening may be used to speed up analysis using such assays. As a result, 

it may be possible to rapidly screen new molecules for their ability to interact with a ribosome or ribosomal subunit 
•fs using the tools and methods of the present invention. General methodologies for performing high-throughput screening 

are described, for example, in Devlin. (1998). High Throughput Screening. Marcel Dekker; and U.S. Patent No. 

5.763.263. High-lhroughpul assays can use one or more different assay techniques including, but not limited to. those 

described below. 

so (1 ) Surface Binding Studies. A variety of binding assays may be useful in screening new molecules for their binding 

activity. One approach includes surface plasmon resonance (SPR) which can be used to evaluate the binding 
properties molecules of interest with respect to a ribosome. ribosomal subunit or a fragment thereof. 

SPR methodologies measure the interaction between two or more macromolecules in real-time through the 
generation of a quantum-mechanical surface plasmon. One device, (BIAcore Biosensor RTM from Pharmacia 

55 Biosensor. Piscatawy. N J.) provides a focused beam of polychromatic light to the interface between a gold film 

(provided as a disposable biosensor "chip") and a buffer compartment that can be regulated by the user. A 100 
nm thick "hydrogel" composed of carboxylated dextran which provides a matrix for the covalent immobilization of 
analytes of interest is attached to the gold film. When the focused light interacts with the free electron cloud of the 
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gold film, plasmon resonance is enhanced. The resulting reflected light is spectrally depleted in wavelengths that 
optimally evolved the resonance. By separating the reflected polychromatic light into its component wavelengths 
(by means of a prism), and determining the frequencies which are depleted, the BIAcore establishes an optical 
interface which accurately reports the behavior of the generated surface plasmon resonance. When designed as 

5 above, the plasmon resonance (and thus the depletion spectrum) is sensitive to mass in the evanescent field 

(which corresponds roughly to the thickness of the hydrogel). If one component of an interacting pair is immobilized 
to the hydrogel. and the interacting partner is provided through the buffer compartment, the interaction between 
the two components can be measured in real time based on the accumulation of mass in the evanescent field and 
its corresponding effects of the plasmon resonance as measured by the depletion spectrum. This system permits 

10 rapid and sensitive real-time measurement of the molecular interactions without the need to label either component. 

(2) Immunodiagnostics and Immunoassays. These are a group of techniques that can be used for the measurement 
of specific biochemical substances, commonly at low concentrations in complex mixtures such as biological fluids, 
that depend upon the specificity and high affinity shown by suitably prepared and selected antibodies for their 

'5 complementary antigens. A substance to be measured must, of necessity, be antigenic - either an immunogenic 

macromolecule or a haptenic small molecule. To each sample a known, limited amount of specilic antibody is 
added and the fraction of the antigen combining with it. often expressed as the bound:free ratio, is estimated, using 
as indicator a form of the antigen labeled with radioisotope (radioimmunoassay), fluorescent molecule (fluoroim- 
munoassay). stable free radical (spin immunoassay), enzyme (enzyme immunoassay), or other readily dislinguish- 

20 able label 

Antibodies can be labeled in various ways, including: enzyme-linked immunosorbent assay (ELISA); radioim- 
muno assay (RIA); fluorescent immunoassay (FIA); chemiluminescent immunoassay (CLIA): and labeling the an- 
tibody with colloidal gold particles (immunogold). 

Common assay formats include the sandwich assay, competitive or competition assay, latex agglutination 
2* assay, homogeneous assay, microtitre plate format and the microparticle-based assay. 

(3) Enzyme-linked immunosorbent assay (ELISA). ELISA is an immunochemical technique that avoids the hazards 
of radiochemicals and the expense of fluorescence detection systems. Instead, the assay uses enzymes as indi- 
cators. ELISA is a form of quantitative immunoassay based on the use of antibodies (or antigens) that are linked 

30 to an insoluble carrier surface, which is then used to "capture" the relevant antigen (or antibody) in the test solution. 

The antigen-antibody complex is then detected by measuring the activity of an appropriate enzyme that had pre- 
viously been covalently attached to the antigen (or antibody). 

General methods and compositions for practicing ELISA are described for example, in Crowther (1995) ELISA - 
Theory and Practice (Methods in Molecular Biology) . Humana Press: Challacombe and Kemeny, (1998) ELISA 
35 and Other Solid Phase Immunoassays - Theoretical and Practical Aspects, John Wiley; Kemeny. (1 991 ) A Practical 

Guide to ELISA . Pergamon Press; Ishikawa. (1991) Ultrasensitive and Rapid Enzyme Immunoassay (Laboratory 
Techniques in Biochemistry and Molecular Biology) Elsevier. 

(4) Colorimetric Assays. Colorimetry is any method of quantitative chemical analysis in which the concentration 
jo or amount of a compound is determined by comparing the color produced by the reaction of a reagent with both 

standard and test amounts of the compound, often using a colorimeter. A colorimeter is a device for measuring 
color intensity or differences in color intensity, either visually or photoelectrically. 

Standard colorimetric assays of beta-galactosidase enzymatic activity are well known to those skilled in the 
art (see. for example. Norton etai (1985) Mol. Cell. Biol. 5: 281-290). A colorimetric assay can be performed on 
•*s whole cell lysates using O-nitrophenyl-D-galactopyranoside (ONPG. Sigma) as the substrate in a standard color- 

imetric beta-galactosidase assay (Sambrook et at. (1989) Molecular Cloning - A Laboratory Manual. Cold Spring 
Harbor Laboratory Press). Automated colorimetric assays are also available lor the detection of - galaclosidase 
activity, as described in U.S. Patent No. 5.733.720. 

so (5) Immunofluorescence Assays. Immunofluorescence or immunofluorescence microscopy is a technique in which 

an antigen or antibody is made fluorescent by conjugation to a fluorescent dye and then allowed to react with the 
complementary antibody or antigen in a tissue section or smear. The location of the antigen or antibody can then 
be determined by observing the fluorescence by microscopy under ultraviolet light. 

A general description of immunofluorescent techniques appears for example, in Knapp et at. (1978) Immun- 

55 ofluorescence and Related Staining Techniques , Elsevier: Allan. (1999) Protein Localization by Fluorescent Mi- 

croscopy - A Practical Approach (The Practical Approach Series) Oxford University Press: Caul, (1993) Immun- 
ofluorescence Antigen Detection Techniques in Diagnostic Microbiology . Cambridge University Press. For detailed 
explanations of immunofluorescent techniques applicable to the present invention, see. for example. U.S. Patent 
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No. 5,912,176: U.S. Patent No. 5.869.264; U.S. Patent No. 5,866.319: and U.S. Patent No. 5.861.259. 

(6) Fluorescence Polarization. Fluorescence polarization (FP) is a measurement technique that can readily be 
applied to protein-protein and protein-ligand interactions in order to derive IC 50 s and Kds of the association reaction 

5 between two molecules. In this technique one of the molecules of interest is conjugated with a fluorophore. This 

is generally the smaller molecule in the system (in this case, the molecule of interest). The sample mixture, con- 
taining both the ligand-probe conjugate and the ribosome. ribosomal subunit or fragment thereof, is excited with 
vertically polarized light. Light is absorbed by the probe fluorophores. and re-emitted a short time later. The degree 
of polarization of the emitted light is measured. Polarization of the emitted light is dependent on several factors, 

w but most importantly on viscosity of the solution and on the apparent molecular weight of the fluorophore. With 

proper controls, changes in the degree of polarization of the emitted light depends only on changes in the apparent 
molecular weight of the fluorophore. which in-turn depends on whether the probe-ligand conjugate is free in solu- 
tion, oris bound to a receptor. Binding assays based on FP have a number of important advantages, including the 
measurement of IC 50 s and Kds under true homogenous equilibrium conditions, speed of analysis and amenity to 

J5 automation, and ability to screen in cloudy suspensions and colored solutions. 

(7) Protein Synthesis. It is contemplated that, in addition to characterization by the foregoing biochemical assays, 
the molecule of interest may also be characterized as a modulator (for example, an inducer of protein synthesis 
or an inhibitor ol protein synthesis) of the functional activity of the ribosome or ribosomal subunit. 

20 

[0346] Inhibitors of protein synthesis may be assayed on the cellular level. For example, molecules of interest can 
be assayed for inhibitory action against organisms, for example, micro-organism, by growing the micro-organism of 
interest in media either containing or lacking the molecule of interest. Growth inhibition may be indicative that the 
molecule may be acting as a protein synthesis inhibitor. 

25 [0347] Furthermore, more specific protein synthesis inhibition assays may be performed by administering the com- 
pound to a whole organism, tissue, organ, organelle, cell, a cellular or subcellular extract, or a purified ribosome prep- 
aration and observing its pharmacological and inhibitory properties by determining, for example, its inhibition constant 
(IC 50 ) for inhibiting protein synthesis. Incorporation of 3 H leucine or 35 S methionine, or similar experiments can be 
performed to investigate protein synthesis activity. 

30 [0348] A change in the amount or the rate of protein synthesis in the cell in the presence of a molecule of interest 
indicates that the molecule is an inducer of protein synthesis. A decrease in the rate or the amount of protein synthesis 
indicates that the molecule is a inhibitor of protein synthesis 

H. Drug Formulation and Administration 

35 

[0349] It is contemplated that once identified, the active molecules of the invention may be incorporated into any 
suitable carrier prior to use. More specifically, the dose of active molecule, mode of administration and use of suitable 
carrier will depend upon the target and non-target organism of interest. 

[0350] It is contemplated that with regard to mammalian recipients, the compounds of interest may be administered 
jo by any conventional approach known and/or used in the art. Thus, as appropriate, administration can be oral or 
parenteral, including intravenous and intraperitoneal routes of administration. In addition, administration can be by 
periodic injections of a bolus, or can be made more continuous by intravenous or intraperitoneal administration from 
a reservoir which is external {e.g.. an intrvenous bag). In certain embodiments, the compounds of the invention can 
be therapeutic-grade. That is. certain embodiments comply with standards of purity and quality control required for 
administration to humans. Veterinary applications are also within the intended meaning as used herein. 
[0351] The formulations, both for veterinary and for human medical use. of the drugs according to the present in- 
vention typically include such drugs in association with a pharmaceulically acceptable carrier therefore and optionally 
other therapeutic ingredient(s). The carrier(s) should be "acceptable" in the sense of being compatible with the other 
ingredients of the formulations and not deleterious to the recipient thereof. Pharmaceutical^ acceptable carriers, in 
so this regard, are intended to include any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents, and the like, compatible with pharmaceutical administration. The use of such 
media and agents for pharmaceulically active substances is known in the art. Except insofar as any conventional media 
or agent is incompatible with the active compound, use thereof in the compositions is contemplated. Supplementary 
active compounds (identified or designed according to the invention and/or known in the art) also can be incorporated 
55 into the compositions. The formulations may conveniently be presented in dosage unit form and may be prepared by 
any of the methods well known in the art of pharmacy/microbiology In general, some formulations are prepared by 
bringing the drug into association with a liquid carrier or a finely divided solid carrier or both, and then, if necessary, 
shaping the product into the desired formulation. 
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[0352] A pharmaceutical composition of the invention should be formulated to be compatible with its intended route 
of administration. Examples of routes of administration* include oral or parenteral, e.g.. intravenous, intradermal, inha- 
lation, transdermal (topical), transmucosal. and rectal administration. Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the following components: a sterile diluent such as water for 

5 injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents: anti- 
bacterial agents such as benzyl alcohol or methyl parabens: antioxidants such as ascorbic acid or sodium bisulfite: 
chelating agents such as ethylenediaminetetraacetic acid: buffers such as acetates, citrates or phospnates and agents 
for the adjustment of tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, such as 
hydrochloric acid or sodium hydroxide. 

w [0353] Useful solutions for oral or parenteral administration can be prepared by any of the methods well known in 
the pharmaceutical art. described, for example, in Remington's Pharmaceutical Sciences. (Gennaro. A., ed.). Mack 
Pub., (1990). Formulations for parenteral administration can also include glycocholate for buccal administration, meth- 
oxysalicylate for rectal administration, or cutric acid for vaginal administration. The parenteral preparation can be en- 
closed in ampoules, disposable syringes or multiple dose vials made of glass or plastic. Suppositories for rectal ad- 

'5 ministration also can be prepared by mixing the drug with a nonirritating excipient such as cocoa butter, other glycendes, 
or other compositions which are solid at room temperature and liquid at body temperatures. Formulations also can 
include, for example, polyalkylene glycols such as polyethylene glycol, oils of vegetable origin, hydrogenated naph- 
thalenes, and the like. Formulations for direct administration can include glycerol and other compositions of high vis- 
cosity. Other potentially useful parenteral carriers for these drugs include ethylene-vinyl acetate copolymer particles. 

20 osmotic pumps, implantable infusion systems, and liposomes. Formulations for inhalation administration can contain 
as excipients, for example, lactose, or can be aqueous solutions containing, for example, polyoxyethylene-9-lauryl 
ether, glycocholate and deoxycholate, or oily solutions for administration in the form of nasal drops, or as a gel to be 
applied intranasally. Retention enemas also can be used for rectal delivery. 

[0354] Formulations of the present invention suitable for oral administration may be in the form of discrete units such 
25 as capsules, gelatin capsules, sachets, tablets, troches, or lozenges, each containing a predetermined amount of the 
drug: in the form of a powder or granules: in the form of a solution or a suspension in an aqueous liquid or non-aqueous 
liquid; or in the form of an otl-in-water emulsion or a water-in-oil emulsion. The drug may also be administered in the 
form of a bolus, electuary or paste. A tablet may be made by compressing or moulding the drug optionally with one or 
more accessory ingredients. Compressed tablets may be prepared by compressing, in a suitable machine, the drug 
30 in a free-flowing form such as a powder or granules, optionally mixed by a binder, lubricant, inert diluent, surface active 
or dispersing agent. Moulded tablets may be made by moulding, in a suitable machine, a mixture of the powdered drug 
and suitable carrier moistened with an inert liquid diluent. 

[0355] Oral compositions generally include an inert diluent or an edible carrier. For the purpose of oral therapeutic 
administration, the active compound can be incorporated with excipients. Oral compositions prepared using a fluid 

35 carrier for use as a mouthwash include the compound in the fluid carrier and are applied orally and swished and 
expectorated or swallowed. Pharmaceutically compatible binding agents, and/or adjuvant materials can be included 
as part of the composition. The tablets, pills, capsules, troches and the like can contain any of the following ingredients, 
or compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient 
such as starch or lactose; a disintegrating agent such as alginic acid. Primogel, or corn starch: a lubricant such as 

^o magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide: a sweetening agent such as sucrose or 
saccharin: or a flavoring agent such as peppermint, methyl salicylate, or orange flavoring. 

[0356] Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions (where water sol- 
uble) or dispersions and sterile powders forthe extemporaneous preparation of sterile injectable solutions or dispersion. 
For intravenous administration, suitable carriers include physiological saline, bacteriostatic water, Cremophor ELTM 

•*5 (BASF. Parsippany. NJ) or phosphate buffered saline (PBS). In all cases, the composition should be sterile and should 
be fluid to the extent that easy syringability exists. It should be stable under the conditions of manufacture and storage 
and should be preserved against the contaminating action of microorganisms such as bacteria and fungi. The carrier 
can be a solvent or dispersion medium containing, for example, water, ethanol. polyol (for example, glycerol propylene 
glycol, and liquid polyetheytene glycol, and the like), and suitable mixtures thereof. The proper fluidity can be main- 

so tained. for example, by the use of a coating such as lecithin, by the maintenance of the required particle size in the 
case of dispersion and by the use of surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol. phenol, ascorbic acid, thimcrosal, and the 
like. In many cases, it will be preferable to include isotonic agents, for example, sugars, polyalcohols such as manitol, 
sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable compositions can be brought about 

55 by including in the composition an agent which delays absorption, for example, aluminum monostearate and gelatin. 
[0357] Sterile injectable solutions can be prepared by incorporating the active compound in the required amount in 
an appropriate solvent with one or a combination of ingredients enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by incorporating the active compound into a sterile vehicle which 
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contains a basic dispersion medium and the required other ingredients from those enumerated above. In the case of 
sterile powders for the preparation of sterile injectable solutions, methods of preparation include vacuum drying and 
freeze-drying which yields a powder of the active ingredient plus any additional desired ingredient from a previously 
sterile-filtered solution thereof. 

5 [0358] Formulations suitable for intra-articular administration may be in the form of a sterile aqueous preparation of 
the drug which may be in microcrystalline form, for example, in the form of an aqueous microcrystalline suspension. 
Liposomal formulations or biodegradable polymer systems may also be used to present the drug for both intra-articular 
and ophthalmic administration. 

[0359] Formulations suitable for topical administration, including eye treatment, include liquid or semi-liquid prepa- 
re rations such as liniments, lotions, gets, applicants, oil-in-water or water-in-oil emulsions such as creams, ointments or 
pasts: or solutions or suspensions such as drops. Formulations for topical administration to the skin surface can be 
prepared by dispersing the drug with a dermatologically acceptable carrier such as a lotion, cream, ointment or soap. 
Particularly useful are carriers capable of forming a film or layer over the skin to localize application and inhibit removal. 
For topical administration to internal tissue surfaces, the agent can be dispersed in a liquid tissue adhesive or other 
is substance known to enhance adsorption to a tissue surface. For example, hydroxypropylcellulose or fibrinogen/ 
thrombin solutions can be used to advantage. Alternatively, tissue-coating solutions, such as pectin-containing formu- 
lations can be used. 

[0360] For inhalation treatments, inhalation of powder {self-propelling or spray formulations) dispensed with a spray 
can. a nebulizer, or an atomizer can be used. Such formulations can be in the form of a fine powder for pulmonary 

20 administration from a powder inhalation device or self-propelling powder-dispensing formulations. In the case of self- 
propelling solution and spray formulations, the effect may be achieved either by choice of a valve having the desired 
spray characteristics (i.e.. being capable of producing a spray having the desired particle size) or by incorporating the 
active ingredient as a suspended powder in controlled particle size. For administration by inhalation, the compounds 
also can be delivered in the form of an aerosol spray from pressured container or dispenser which contains a suitable 

25 propellant. e.g., a gas such as carbon dioxide, or a nebulizer. 

[0361 ] Systemic administration also can be by transmucosal or transdermal means. For transmucosal or transdermal 
administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants 
generally are known in the an. and include, for example, for transmucosal administration, detergents, bile salts, and 
filsidic acid derivatives. Transmucosal administration can be accomplished through the use of nasal sprays or suppos- 

30 itories. For transdermal administration, the active compounds typically are formulated into ointments, salves, gels, or 
creams as generally known in the art. 

[0362] The active compounds may be prepared with carriers that will protect the compound against rapid elimination 
from the body, such as a controlled release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid. 

35 collagen, polyorthoesters. and polylactic acid. Methods for preparation of such formulations will be apparent to those 
skilled in the art. The materials also can be obtained commercially from Alza Corporation and Nova Pharmaceuticals. 
Inc. Liposomal suspensions can also be used as pharmaceutical^ acceptable carriers. These can be prepared ac- 
cording to methods known to those skilled in the art. for example, as described in U.S. Pat. No. 4.522.811 . Microsomes 
and microparticles also can be used. 

40 [0363] Oral or parenteral compositions can be formulated in dosage unit form for ease of administration and uniformity 
of dosage Dosage unit form refers to physically discrete units suited as unitary dosages for the subject to be treated: 
each unit containing a predetermined quantity of active compound calculated to produce the desired therapeutic effect 
in association with the required pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound and the particular therapeutic 
effect to be achieved, and the limitations inherent in the an of compounding such an active compound for the treatment 
of individuals. 

[0364] As noted above, drugs identified or designed according to the invenlion can be formulated into pharmaceutical 
compositions by admixture with pharmaceutically acceptable nontoxic excipients and carriers. Such compositions can 
be prepared for parenteral administration, particularly in the form of liquid solutions or suspensions: for oral adminis- 
50 tration. particularly in the form of tablets or capsules: or tntranasally. particularly in the form of powders, nasal drops 
or aerosols. Where adhesion to a tissue surface is desired the composition can include the drug dispersed in a fibrin- 
ogen-thrombin composition or other bioadhesivc. The drug then can be painted, sprayed or otherwise applied to the 
desired tissue surface. 

Alternatively, the drugs can be formulated for parenteral or oral administration to humans or other mammals, for ex- 
55 ample, in therapeutically effective amounts, e.g.. amounts which provide appropriate concentrations of the drug to 
target tissue for a time sufficient to induce the desired effect 

[0365] Where the active compound is to be used as part of a transplant procedure, it can be provided to the living 
tissue or organ to be transplanted prior to removal of tissue or organ from the donor. The drug can be provided to the 
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donor host. Alternatively or. in addition, once removed from the donor, the organ or living tissue can be placed in a 
preservation solution containing the active compound. In all cases, the active compound can be administered directly 
to the desired tissue, as by injection to the tissue, or it can be provided systemically. either by oral or parenteral ad- 
ministration, using any of the methods and formulations described herein and/or known in the art. 
5 [0366] Where the drug comprises part of a tissue or organ preservation solution, any commercially available pres- 
ervation solution can be used to advantage. For example, useful solutions known in the art include Collins solution. 
Wisconsin solution. Belzer solution. Eurocollins solution and lactated Ringer's solution. 

[0367] The effective concentration of the compounds to be delivered in a therapeutic composition will vary depending 
upon a number of factors, including the final desired dosage of the compound to be administered and the route of 

io administration. The preferred dosage to be administered also is likely to depend on such variables as the type and 
extent of disease or indication to be treated, the overall health status of the particular patient, the relative biological 
efficacy of the compound delivered; the formulation of the drug ; the presence and types of excipients in the formulation, 
and the route of administration. In general terms., the drugs of this invention can be provided to an individual using 
typical dose units deduced from the earlier-described mammalian studies using non-human primates and rodents. 

15 [0368] When the active compounds are nucleic acid molecules, the nucleic acid may be inserted into vectors and 
used as gene therapy vectors. Gene therapy vectors can be delivered to a subject by : for example, intravenous injection, 
local administration (see U.S. Pat. No. 5.328.470) or by stereotactic injection (see e.g., Chen etal. (1994) Proc. Natl. 
Acad Set. USA 91 :3054-3057). The pharmaceutical preparation of the gene therapy vector can include the gene ther- 
apy vector in an acceplable diluent, or can comprise a slow release matrix in which the gene delivery vehicle is im- 

20 bedded. Alternatively, where the complete gene delivery vector can be produced intact from recombinant cells, e.g. 
retroviral vectors, the pharmaceutical preparation can include one or more cells which produce the gene delivery sys- 
tem. 

[0369] When an active compound of the invention is intended for administration to a plant host, the invention may 
be applied directly to the plant environment, for example, to the surface of lcavcs : buds ; roots or floral parts. Altcrna- 

25 tively. the present invention can be used as a seed coating. The determination of an effective amount of the present 
invention as required for a particular plant is within the skill of the art and will depend on such factors as the plant 
species, method of planting, and soil type. It is contemplated that compositions containing drugs of the invention can 
be prepared by formulating such drugs with adjuvants, diluents, carriers, etc., to provide compositions in the form of 
filings/divided particulate solids, granules, pellets, wetable powders, dust, aqueous suspensions or dispersions, and 

30 emulsions. It is further contemplated to use such drugs in capsulated form, for example, the drugs can be encapsulated 
within polymer, gelatin, lipids or other formulation aids such as emulsifiers. surfactants wetting agents, antifoam agents 
and anti-freeze agents, may be incorporated into such compositions especially if such compositions will be stored for 
any period of time prior to use. Application of compositions containing drugs of the invention as the active agent can 
be carried out by conventional techniques. When an active compound is intended for administration to an insect host, 

35 standard methods such as. but not limited to. aerial dispersal are contemplated. 

[0370] Active compound identified or designed by a method of the invention also include precursors of the active 
compounds. The term precursors refers to a pharmacologically inactive (or partially inactive) derivative of a parent 
molecule that requires biotransformation, either spontaneous or enzymatic, within the organism to release the active 
compounds. Precursors are variations or derivatives of the compounds of the invention which have groups cleavable 

*o under metabolic conditions. Precursors become the active compounds of the invention which are pharmaceutically 
active in vivo, when they undergo solvolysis under physiological conditions or undergo enzymatic degradation. Pre- 
cursor forms often offer advantages of solubility, tissue compatibility, or delayed release in the mammalian organism 
(see Bundgard. Design of Prodrugs , pp. 7-9. 21-24. Elsevier. Amsterdam (1985): and Silverman. The Organic Chem- 
istry of Drug Design and Drug Action, pp. 352-401 . Academic Press, San Diego. CA (1992). 

4 $ [0371] Active compound as identified or designed by the methods described herein can be administered to individuals 
to treat disorders (prophylactically or therapeutically). In conjunction with such treatment, pharmacogenomics {i.e., the 
study of the relationship between an individual's genotype and that individual's response to a foreign compound or 
drug) may be considered. Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic failure 
by altering the relation between dose and blood concentration of the pharmacologically active drug. Thus, a physician 

so or clinician may consider applying knowledge obtained in relevant pharmacogenomics studies in determining whether 
to administer a drug as well as tailoring the dosage and/or therapeutic regimen of treatment with the drug. 
[0372] With regard to mammals, it is contemplated that the effective dose of a protein synthesis inducer or inhibitor 
will be in the range of about 0.01 to about 50 mg/kg. preferably about 0. 1 to about 1 0 mg/kg of body weight, administered 
in single or multiple doses. Typically, the inducer or inhibitor may be administered to a human recipient in need of 

55 treatment at a daily dose range of about 1 to about 2000 mg per patient. 

[0373] In light of the foregoing general discussion, the specific examples presented below are illustrative only and 
are not intended to limit the scope of the invention. Other generic and specific configurations will be apparent to those 
persons skilled in the art. 
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III. Examples 

A. Example 1 : Preparation of SOS Ribosomal Subunit Crystals 

5 [0374] H. marismortui (ATCC 43049) was grown as described previously (Ban et at. (1998) supra) on a slightly 
modified version of ATCC culture medium 1230. which was supplemented with 4.3 g of yeast extract. 5.1 g of Tris. and 
3.4 g of glucose per liter. Bacteria were grown at 37°C to an OD 5S0nm between 1 .0 and 2.2. They were harvested by 
centrifugation. and stored at -80°C. Cells were ruptured using a French press. Ribosomes were prepared from lysates 
by centrifugation. and subunits were isolated on sucrose gradients (Shevack et ai (1985) FEBS Lett. 184: 68-71). 

10 

1 . Reverse Extraction 
[0375] 

15 (1 ) Take 1 mg of subunits from a concentrated SOS ribosomal subunit stock (30mg/ml in 1 .2 M KCI. 0.5 M NH4CI. 

20 mM MgCI 2 , Tris 10 mM. CdCI 2 1 mM. Tns 5 mVL pH 7.5) and mix with 1/2 vol. of 30% PEG6000 (300g PEG. 
700 ml H 2 0 to make 1 liter of 30% PEG: filter through 0.2 urn filter). Leave on ice for 1 to 2 nr. 

(2) Spin down precipitate for about 30 seconds using a desktop centrifuge. 

20 

(3) Remove supernatant and add 100 I of RE-buffer: 7% PEG6000. 1 .2 M KCI, 0.5 M NH 4 CI. 100 mM KAc 30 mM 
MgCI 2 . 10 mM Tris t 10 mM MES (pH 7.5). and 1 mM CdCI 2 . 

(4) Rcsuspcnd pellet at room temperature by mixing with a P200 pipette set at 50 I. Rcsuspcndcd material should 
25 appear a little cloudy. 

(5) Wrap the eppendorf tube in aluminum foil and leave for equilibration at room temperature for 30-60 min. The 
solution will be saturated with 50S 

30 (6) Spin down for 2 minutes in desk-top centrifuge at room temperature, transfer supernatant to new eppendorf 

tube. A little pellet should be found in the tube used for centrifugation. Keep the supernatant at room temperature. 

(7) Put 8-1 0 u.l of supernatant in the sample well of a sitting drop tray (Charles-Supper). Streak seed one hour later 
from a seed stock. Seed stock is prepared by putting previously grown crystals in stabilizing solution buffer A (see 
35 below), and then vortexing them violently. To streak seed, a human hair cleaned with water and ethanol and then 

dried is passed through the vortexed solution and then touched on the new crystallization drop. Drops should took 
cloudy. The reservoirs in the sitting drop trays contain 1 000 I of a solution containing 8% PEG6000. 1 .2 M KCI. 0.5 
M NH 4 CI, 100 mM KAc. 6.5 mM HAc (yields pH 5.8). 30 mM MgCl 2 . and 1 mM CdCI 2 . 

io (B) Check after one day if seeding is succeeded. If yes. let crystals grow for three weeks. 

2. Stabilization Protocol 

[0376] When crystals have finished growing (after approximately 3 weeks), each sitting drop chamber is opened by 
is making just a single cut (slit) going from the middle and to the edge of the well. Through this narrow slit. 1 0 uJ of buffer 
A (1.2 M KCI. 0.5 M NH 4 CI. 30 mM MgCI 2 . 10%PEG6000. 1 mM CdCI 2 , 100 mM KAc. 10 mM Tris (titrated to final pH 
6.1), 30 mM MES) al room temperature is added lo each drop and 45 ul or Buffer C (0.667 M MES. 0.333 M Tris) lo 
each reservoir. 

[0377] Trays are put in a plastic box with a lid, and put in a 16" C incubator for approximately one day ; and then 
50 moved to 12° C for another day. The plastic box is then put in a polystyrene container with a lid. and put in the cold 
room for yet another day. Crystals can be kept like this for a long time, but need to undergo a further changing of buffer 
prior to any use. 

[0378] Make the following transition series using buffer A and buffer B (1 .7 M NaCL 0.5 M NH 4 CI, 30 mM MgCI 2 . 1 
mM CdCI 2 . 12% PEG6000. 20% EG. 100MM KAC (titrated to final pH 5.8 with HAC) to give final ratios of buffer B to 
55 buffer A of: 1 /1 6 : 1/8. 1/4. 1/2, 3/4. All solutions should be at cold room temperature. All manipulations of the drops will 
take place through the narrow slit. 

(1) Add 40 ul l, ly16 B to the drop, leave for 15 minutes. 
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(2) Add 40 ul "1/8" to the drop, leave for 30-60 minutes. 

(3) Take out 40 ul from the drop (and discard it in the reservoir), add 40 ul "1/4". leave for 30-60 minutes. 
5 (4) Take out 40 ul from the drop (and discard it in the reservoir), add 40 "1/2", leave for 15 minutes. 

(5) Take out 40 ul from the drop (and discard it in the reservoir), add 40 ul "3/4", leave for 15 minutes. 

(6) Take out 40 ul from the drop (and discard it in the reservoir), add40 uJ buffer B. leave for 15 minutes. 

w 

(7) Take out 60-80 ul from the drop (and discard it in the reservoir), add 60-80 uJ Buffer B. replace reservoirs with 
500 ul buffer B. 

B. Example 2: Determination of the Crystal Structure of the 50S Ribosomal Subunit, With the Initial Refinement 

15 

[0379] AH data, except the two native data sets, were collected at the National Synchrotron Light Source (Brookhaven) 
from crystals frozen at 100 K. using beamlines X12b and X25 and recorded using a 345 mm MAR imaging plate. For 
each heavy atom derivative, anomalous diffraction data were collected at the wavelength corresponding to the peak 
anomalous scattering. The beam size was 1 00 x 1 00 urn for most data collections at X25 and 200 x 200 urn al beamline 

20 Xl2b. The crystals were aligned along the long axis of the unit cell (570 A) so that 1.0° oscillations could be used to 
collect reflections out to a maximum of 2.7 A resolution at the edge of the MAR detector. At beamline X1 2b the crystal 
to detector distances varied between 450.0 mm and 550.0 mm depending on wavelength, crystal quality and beam 
divergence, and it was chosen so that maximum resolution data could be collected while avoiding overlapping of spots. 
At beamline X25 the detector was positioned on a rigid platform at 480 mm which allowed data collection to 3.2 A for 

25 iridium and osmium derivatives with the wavelength set at the anomalous edge. Native data to 2.4 A resolution were 
collected at the structural biology beamline ID19 of the Advanced Photon Source (Argonne) using a CCD detector 
(etc.). Data sets were processed by using DENZO and SCALEPACK (Otwinowski, (1 993) Data Collection and Process- 
ing). 

[0380] Heavy atom based phasing was extended to 3.2 A resolution by combining MIR phases calculated for two 
30 different isomorphous groups of data (MIR1 and MIR2. Table 1) with single derivative anomalous dispersion (SAD) 
phases. The best two derivatives were osmium pentamine and iridium hexamine. each of which contained a large 
number of binding sites (Table 1 ). Several other derivatives with smaller number of sites further improved map quality. 
All phasing was done by maximum likelihood method implemented in CNS (Brunger et at. (1998) supra) with the ex- 
ception of the Ta 6 Br 12 derivative, which was refined in SHARP (de La Fortelle. (1997) Meth. Enzymoi 276: 472-494) 
35 represented as spherically averaged electron density (Table 1). Phases were improved and extended from 3.3 A to 
2.4 A by solvent flipping (Abrahams et at. (1996) supra) and models were built. 

C. Example 3: Preparation of Crystals of 50S Ribosomal Subunit/Puromycin Complex and Collection of X-ray 
Diffraction Data 

40 

[0381] Crystals of SOS ribosomal subunits were grown and stabilized as described earlier. CCdA-p-puromycin (see 
Figure 9A) was a generous gift from Michael Yarus (Welch, ef at. (1 995) supra). Oligonucleotides from amino-N-acylated 
mmihelices (see Figure 9B) were synthesized by Dharmacon. Following deprotection . the oligonucleotides were heated 
briefly to 1 00° C and snap-cooled on ice to reanneal. Ribosomal 50S subunit crystals were stabilized and then soaked 

is for 24 hours in stabilization buffer plus 1 00 uM CCdA-p-puromycin or amino-N-acylated mini-helices prior to cryovitri- 
fication in liquid propane and X-ray diffraction data collection. Phases were calculated by density modification (CNS) 
beginning with the best experimental phases using 2F 0 (analogue)- F D (nalive) for amplitudes, from 60.0 to 3.2 A. (Native 
amplitudes were from the most isomorphous native 1 data set. except for those amplitudes which were present only 
in the more complete native 2 data set. Calculated 2F o -F 0 amplitudes which were less than twice the corresponding 

so calculated & were replaced by F 0 (analogue)). Maps were then calculated using phases from density modified and 2F 0 
(analogue)- F 0 (native) or F 0 (analogue)- F Q (native) amplitudes. 

D. Example 4: Antibiotic Binding Sites Located in the Polypeptide Exit Tunnel Near the Peptidyl Transferase 
Center 

55 

[0382] Crystalline complexes of the H. marismortui large subunit complexed with three antibiotics have been estab- 
lished at about 3.0 A resolution. The electron density maps at this resolution have allowed us to position approximately 
on the ribosome the antibiotics tylosin. carbomycin and anisomycin. We observed that these antibiotics all bind to the 
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ribosome in the region that lies between the peptidyl transferase center as defined by the Yarus inhibitor CCdA-p- 
puromycin. and the tips of the proteins L22 and L4 at the point that they form a small orifice in the polypeptide exit 
tunnel. The general location of this major antibiotic binding site is shown in Figure 1 9 Tylosin and carbomycin appear 
to function by blocking the exit of newly synthesized polypeptides. Anisomycin blocks the A site. 
5 [0383] It is contemplated that the antibiotic erythromycin will bind in almost the same location as tylosin because of 
the similarity of the two molecules and because erythromycin resistance mutations are known in both the tip of protein 
L4 and in portions of the RNA near the tylosin binding site. 

[0384] The vast majority of the interactions between those antibiotics and the ribosome are through rRNA that forms 
the A site, and the surtace of the tunnel between the peptidyl transferase center and protein L22. Since these antibiotics 

w do not bind identically, there will be many additional ways that small molecule compounds can be designed to bind in 
this region using the tools and methodologies of the present invention. For example, by connecting together compo- 
nents of each of the different antibiotics which bind to non-overlapping sites it will be possible to create new antibiotics 
(see. Example 6). In addition, based on new principles of small molecule RNA interaction shown by these antibiotic 
complexes we will be able to design entirely novel small molecules that will bind to sites on the ribosome as well as 

J 5 other potential RNA targets. 

E. Example 5: Design and Testing of Hybrid Antibiotics 

[0385] Many antibiotics that target ribosomes. more particularly large ribosomal subunils, and disrupt protein syn- 

20 thesis are complex molecules that are effectively concatenations of simpler substructures, at least one of which interacts 
with a discrete part of the ribosome. When the compound in question includes several interactive substructures, its 
binding site is effectively the sum of the subsites that contact and engage each such substructure. It has been found 
that many antibiotics that target the large ribosomal subunit bind the ribosomal subunit at sites that are close to one 
another. Thus the possibility exists of synthesizing new antibiotics in which one ribosomc-binding moiety of a first 

25 known antibiotic is linked chemically to a ribosome-binding moiety of a second known antibiotic that interacts with an 
adjacent subsite. The new compound that results is thus a chimera of the two antibiotics from which it derives. 
[0386] Chimeric antibiotics can be designed using the information about the structures of antibiotic/ribosome com- 
plexes discussed hereinabove. These structures permit the identification of antibiotic binding subsites in the ribosome, 
and the specification of the chemical entities that interact with them. Equipped with such knowledge, those skilled in 

30 the art of organic synthesis can synthesize compounds that link the substructures of interest together in ways that 
should enable them to interact with their respective subsites at the same time. Any compound devised this way that 
functions in the manner intended is likely to inhibit cell growth and if it does, protein synthesis in vivo. At the very least, 
it should block protein synthesis in in vitro assay systems. Further information about the ribosomal interactions of such 
a compound can be obtained by determining the structure of the complex it forms with the ribosome using the methods 

35 described in Section D. hereinabove, 

[0387] For example, as a result of the work described herein, it has been discovered that the disaccharide moiety 
of carbomycin binds the large ribosomal subunit at a site in close proximity to the binding site for a portion of the 
anisomycin. Using this information and the software packages described hereinabove, the skilled artisan can design 
a hybrid antibiotic comprising the relevant ribosome binding portions of carbomycin and anisomycin linked by a suitable 

-to chemical linker. This hybrid molecule, once designed, can be synthesized and purified using conventional synthetic 
organic chemistries and conventional purification schemes. Once synthesized and purified, the hybrid molecule can 
be screened for bioactivity. These screens can include, for example, growing micro-organisms on or in media either 
supplemented or lacking the hybrid molecule. Any reduction in the number of micro-organisms or the size of colonies 
in the presence of the hybrid molecule would be indicative of bioactivity. Furthermore, the hybrid molecule could be 

■*5 tested in a cell free translation system in the presence of one or more labeled amino acids. Any reduction in the level 
of labeled amino acids incorporated into proteins in cell free systems that include the hybrid molecule relative to cell 
free systems locking the hybrid molecule would be indicative that the hybrid molecule acts as a functional protein 
synthesis inhibitor. It is contemplated that the hybrid molecule could then be iteratively refined as discussed hereinabove 
to enhance its bioactive peptides and bioavailability. 

so 

INCORPORATION BY REFERENCE 

[0388] The disclosure of each of the patent documents, scientific articles, atomic-co-ordinates (including, without 
limitation, those sets deposited at the Research Collaboratory for Structural Bioinformatics Protein Data Bank (PDB) 
55 with the accession numbers PDB ID: 1FFK; PDB ID: 1FFZ: PDB ID: 1FG0; and PDB ID: 1JJ2, and/or contained on 
Disk No 1.2, or 3) referred to herein is incorporated by reference herein. 



BNSDOClD <EP U86769A2 I > 



75 



EP1 188 769 A2 



EQUIVALENTS 

[0389] The invention may be embodied in other specific forms without departing form the spirit or essential charac- 
teristics thereof. The foregoing embodiments are therefore to be considered in alt respects illustrative rather than limiting 
5 on the invention described herein. Scope of the invention is thus indicated by the appended claims rather than by the 
foregoing description, and all changes that come within the meaning and range of equivalency of the claims are intended 
to be embraced therein. 

[0390] Copies of the compact disks hereinbefore referenced as compact disks 1 of 3. 2 of 3 and 3 of 3 are filed 
simultaneously with this application for inclusion on the public file of this EPO application. The listings of atomic co- 
10 orodinates of the files PDF1FFK.DOC : PDB1 FFZ.DOC and PDB1 FG0.DOC of compact disk 1 of 3 of the file 1JJ2.RTF 
of compact disk 2 of 3 and of the files antsomycin.pdb. blasticidin. pdb, carbomycin.pdb. sparsomycin.pdb. spiramycin, 
pdb. tylosin.pdb and virginiamycin.pdb of compact disk 3 of 3 are also included in the written description of this appli- 
cation. 

[0391] As noted earlier in the description, Figures 1 to 29 include figures in colour. Black and white versions of these 
15 drawings are also included in this application as Figures 1 ' to 29'. A compact disk marked "Figures" of all these Figures, 
colour and black and white, is filed simultaneously with this application for inclusion on the public file of this application. 
[0392] This description now continues with the listings of the atomic co-ordinates from compact disks 1 of 3. 2 of 3 
and 3 of 3. as mentioned above and a sequence listing. 
[0393] The next page of this descriplion is page 1 o( listing PDB1 FFK.DOC. 

20 

Claims 

1. A crystal of a ribosomc or a ribosomal subunit wherein the crystal has an average thickness greater than about 
25 15 jim. 

2. The crystal of claim 1 wherein the average thickness is selected from the group consisting of from about 16 urn 
to about 65 \\m. from about 66 pm to about 105 \xrr\. from about 104 |.im to about 155 \\m, and from about 156 ^m 
to about 205 jim 

30 

3. The crystal of claim 1 wherein the average thickness is from about 100 urn to about 200 urn. 

4. An untwinned crystal of a ribosome or ribosomal subunit. 

35 5. The crystal of claim 1 or 4 wherein the ribosomal subunit is a large ribosomal subunit. 

6. The crystal of claim 1 or 4 wherein the ribosomal subunit is a small ribosomal subunit. 

7. The crystal of claim 1 or 4 wherein the ribosomal subunit is a SOS ribosomal subunit. 

40 

8. The crystal of claim 1 or 4 wherein the ribosome or ribosomal subunit is obtained from a prokaryote or from an 
eukaryote. 

9. The crystal of claim 1 or 4 wherein the ribosome or ribosomal subunit is obtained from an archaebacteria. 

45 

10. The crystal of claim 1 or 4 wherein the ribosome or ribosomal subunit is obtained from Hatoarcula marismortui. 

11. The crystal of claim 1 or 4 wherein the ribosomal subunit is a 60S ribosomal subunit. 

so 12. The crystal of claim 1 or 4 wherein the ribosome or ribosomal subunit is obtained from a mammal. 

13. The crystal of claim 1 or 4 wherein the crystal effectively diffracts X-rays for determination of atomic co-ordinates 
to a resolution of at least about 3.0 A. 

55 14. The crystal of claim 1 or 4 further comprising a ligand. 

15. The crystal of claim 14 wherein the ligand is bound to the ribosome or the ribosomal subunit 
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16. The crystal of claim 15 wherein the ligand is an antibiotic. 

17. The crystal of claim 16 wherein the antibiotic is a macrolide antibiotic. 

5 18. A crystal of a ribosome or a ribosomal subunit wherein the crystal effectively diffracts X-rays for determination of 
atomic co-ordinates to a resolution of at least about 3.0 A. 

19. A crystal of a ribosome or a ribosomal subunit wherein the crystal effectively diffracts X-rays for determination of 
atomic co-ordinates to a resolution of about 2.4 A. 

10 

20. A crystal of a ribosome or a ribosomal subunit wherein the crystal is sufficient to determine the atomic co-ordinates 
of the ribosome or ribosomal subunit. 

21. A crystal of a 50S ribosomal subunit comprising an atomic structure characterized by the atomic co-ordinates 
is deposited at the Protein Data Bank under accession number PDB ID: 1FFK or 1JJ2. 

22. Phases computed from the co-ordinates of claim 21 . 

23. A method of obtaining an electron density map of a selected ribosomal subunit. wherein the selected ribosomal 
20 subunit is different from the ribosomal subunit used to obtain the computed phases of claim 22 said method 

comprising: 

(a) producing a crystal of a selected ribosomal subunit. wherein the crystal is isomorphous; 

25 (b) obtaining diffraction amplitudes of the crystal produced in step (a); 

(c) combining the computed phases of claim 22 with the diffraction amplitudes obtained in step (b) to produce 
a combined data set: and 

30 (d) obtaining an electron density map of the selected ribosomal subunit based on the combined data set ob- 

tained in step (c). 

24. A method of obtaining an electron density map of a selected ribosomal subunit. wherein the selected ribosomal 
subunit is closely related to the ribosomal subunit used to obtain the computed phases of claim 22. said method 

35 comprising: 

(a) producing a crystal of a selected ribosomal subunit. wherein the crystal crystallizes in a different unit cell 
with different symmetry than the crystal which was used to compute the phases of claim 22: 

40 (b) obtaining X-ray diffraction data for the crystal produced in step (a): 

(c) obtaining phases of the selected-ribosomal subunit by using the data obtained in step (b) and the computed 
phases of claim 22 in a molecular replacement technique; and 

45 (d) obtaining an electron density map of the selected ribosomal subunit from the phases obtained in step (c). 

25. A method of obtaining a model of a selected ribosomal subunit. wherein the selected ribosomal subunit diverges 
from but is still homologous to the ribosomal subunit used to obtain the computed phases of claim 22. said method 
comprising: 

so 

(a) producing a crystal of a selected ribosomal subunit; 

(b) obtaining atomic co-ordinates for the crystal produced in step (a): 

55 (c) obtaining a model for the selected ribosomal subunit by homology modeling using the atomic co-ordinates 

obtained in step (b) and the computed phases of claim 22. 

26. A method of growing a crystal of a ribosome or a ribosomal subunit comprising: 
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(a) isolating a ribosome or a ribosomal subunit; 

(b) precipitating the ribosome or ribosomal subunit: 

5 (c) back-extracting the precipitated ribosome or ribosomal subunit to obtain a solution: 

(d) seeding the back-extracted solution: 

(e) growing a crystal of the ribosome or ribosomal subunit from the seeded solution by vapor diffusion at room 
w temperature; and 

(f) harvesting the crystal. 

27. The method of claim 26 further comprising: 

15 

(g) stabilizing the crystal by gradual transfer into a solution containing a high salt concentration; and 

(h) maintaining the crystal under high salt concentration. 

20 28. The method of claim 27 wherein the high salt concentration is from about 1 .2 M salt to about 1 .7 M salt. 

29. The method of claim 27 further comprising: 

(i) flash freezing the crystal. 

25 

30. A crystal produced by the method of claim 26 : 27. 28 or 29. 

31. A method of obtaining X-ray diffraction data for a crystal of a ribosome or a ribosomal subunit comprising: 

30 (a) obtaining a crystal of a ribosome or a ribosomal subunit. wherein the crystal has one or more of the following 

characteristics: 

(1) an average thickness of greater than 15 um: 

35 (2) untwinned; and 

(b) using X-ray crystallography to obtain X-ray diffraction data for the crystal of the ribosome or ribosomal 
subunit. 

40 32. A method of obtaining an electron density map of a ribosome or a ribosomal subunit comprising using the X-ray 
diffraction data obtained by the method of claim 31 to obtain an electron density map of the ribosome or ribosomal 
subunit. 

33. A method of obtaining X-ray diffraction data for a complex of a ribosome and a ligand or a complex of a ribosomal 
•*5 subunit and a ligand comprising: 

(a) obtaining a crystal of a ribosome or a ribosomal subunit, wherein the crystal has one or more of the following 
characteristics: 

so (1) an average thickness of greater than 15 \irr\\ 

(2) untwinned: 

(b) diffusing a ligand through the crystal so that the ligand binds the ribosome or ribosomal subunit to form a 
55 complex: and 

(c) using X-ray crystallography to obtain X-ray diffraction data for the complex. 
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34. A method of obtaining X-ray diffraction data for a complex of a ribosome and a ligand or for a ribosomal subunit 
and a ligand comprising: 

(a) obtaining a co-crystal for a complex of a ribosome and a ligand or for a complex of a ribosomal subunit 
5 and a ligand. wherein the co-crystal has one or more of the following characteristics: 

(1) an average thickness of greater than 15 urn; 

(2) untwinned; and 

10 

(b) using X-ray crystallography to obtain X-ray diffraction data for the complex. 

35. A method of obtaining an electron density map for a complex of a ribosome and a ligand or for a complex of a 
ribosomal subunit and a ligand comprising using the X-ray diffraction data obtained by the method of claim 33 or 

T5 34 to obtain an electron density map of the complex of the ribosome and the ligand or for the complex of the 

ribosomal subunit and the ligand. 

36. The method of claim 33 or 34 wherein the ligand is an antibiotic. 

20 37. A method of locating the attachment of a ligand to a ribosome or the attachment of a ligand to a ribosomal subunit 
comprising: 

(a) obtaining X-ray diffraction data for a ribosome or for a ribosomal subunit according to claim 31; 

25 (b) obtaining X-ray diffraction data for a complex of a ribosome and a ligand or for a complex of a ribosomal 

subunit and a ligand according to the method of claim 33 or 34; 

(c) subtracting the X-ray diffraction data obtained in step (a) from the X-ray diffraction data obtained in step 

(b) to obtain the difference in the X-ray diffraction data; 

30 

(d) obtaining phases that correspond to X-ray diffraction data obtained in step (a) using one or more of the 
techniques selected from the group consisting of MIR. MIRAS and SAD: 

(e) utilizing the phases obtained in step (d) and the difference in the X-ray diffraction data obtained in step (c) 
35 to compute a difference Fourier image of the ligand: and 

(f) locating the attachment of the ligand to a ribosome or the attachment of the ligand to a ribosomal subunit 
based on the computations obtained in step (e). 

JO 38. A method of obtaining a map of a ligand attached to a ribosome or of a ligand attached to a ribosomal subunit 
comprising: 

(a) obtaining X-ray diffraction data for a ribosome or for a ribosomal subunit according to claim 31; 

45 (b) obtaining X-ray diffraction data for a complex of a ribosome and a ligand or a complex of a ribosomal 

subunit and a ligand according to the method of claim 33 or 34; 

(c) obtaining phases that correspond to X-ray diffraction data obtained in step (a) using one or more of the 
techniques selected from the group consisting of MIR. MIRAS and SAD; and 

50 

(d) utilizing the phases obtained in step (c) and the X-ray diffraction data obtained in step (b) to compute a 
map of the ligand and the ribosome or of the ligand and the ribosomal subunit. 

39. The method of claim 37, wherein the ligand is an antibiotic. 

55 

40. A method of obtaining a modified agent comprising: 

(a) obtaining a crystal of a ribosome or of a ribosomal subunit with or without a bound agent; 
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(b) obtaining the atomic co-ordinates of at least a portion of the ribosome or ribosomal subunit with or without 
the bound agent: 

(c) using the atomic co-ordinates and one or more molecular modeling techniques to determine how to modify 
5 the interaction of the agent with a ribosome or ribosomal subunit: and 

(d) modifying the agent based on the determinations obtained in step (c) to produce a modified agent. 

41. The method of claim 40 r wherein the one or more molecular modeling techniques are selected from the group 
io consisting of graphic molecular modeling and computational chemistry. 

42. The method of claim 40 further comprising contacting the modified agent with a ribosome or ribosomal subunit 
and detecting the interaction of the modified agent to the ribosome or ribosomal subunit. 

is 43. A modified agent produced by the method of claim 40 wherein the modified agent binds differently to a ribosome 
or ribosomal subunit than does the agent from which the modified agent was derived. 

44. The modified agent of claim 43 ; wherein the modified agent is a therapeutic agent. 

20 45. The method of claim 40. wherein the atomic co-ordinates of the ribosome or ribosomal subunit crystal are deposited 
at the Protein Data Bank under accession number PDB ID: 1 FFK ; 1 FF2. 1 FGO or 1JJ2. 

46. A modified agent produced by the method of claim 40. wherein the modified agent binds differently to a ribosome 
or ribosomal subunit than docs the agent from which the modified agent was derived. 

25 

47. A computer system comprising: 

(a) a memory having stored therein data indicative of atomic co-ordinates derived from an electron density 
map having a resolution of at least about 4.5 A and defining a ribofunctional locus of a large subunit of a 

30 ribosome; and 

(b) a processor in electrical communication with the memory, the processor comprising a program for gener- 
ating a three-dimensional model representative of the ribofunctional locus. 

35 48. The computer system of claim 47, further comprising a device for providing a visual representation of the model. 

49. The computer system of claim 47. wherein the atomic co-ordinates comprise at least a portion of the atomic co- 
ordinates deposited at the Protein Data Bank under accession number PDB ID: 1 FFK, 1 FFZ. 1 FGO. or IJJ2. 

io so. The computer system of claim 47, wherein the atomic co-ordinates further define at least a portion of a protein 
synthesis inhibitor complexed with a ribofunctional locus. 

51. The computer system of claim 50. wherein the protein synthesis inhibitor is an antibiotic. 

^5 52. The computer system of claim 51 . wherein the atomic co-ordinates comprise at least a portion of the atomic co- 
ordinates recorded on Disk No. 3 of 3 under file number anisomycin.pdb. blasticidin.pdb. carbomycin.pdb. spar- 
somycin.pdb ; spiramycin. pdb. lylosin.pdb. or virginiamycin.pdb. 

53. The computer system of claim 47. wherein the ribofunctional locus comprises at least a portion of an active site 
so in the ribosomal subunit. 

54. The computer system of claim 53. wherein the active site comprises at least a portion of a peptidyl transferase site. 

55. The computer system of claim 54. wherein the peptidyl transferase site is defined by a plurality of residues set 
55 forth in Table 5. 

56. The computer system of claim 47, wherein the ribofunctional locus comprises at least a portion of an A-site. 
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57. The computer system of claim 56. wherein the A-site is defined by a plurality of residues set forth in Table 6 

58. The computer system of claim 47 or 56. wherein the ribofunctional locus comprises at least a portion of a P-site. 

5 59. The computer system of claim 58. wherein the P-site is defined by a plurality of residues set forth in Table 7. 

60. The computer system of claim 47 or 56, wherein the ribofunctional locus comprises at least a portion of a polypep- 
tide exit tunnel. 

10 61. The computer system of claim 60 : wherein the exit tunnel is defined by a plurality of residues set forth in Table 8. 
Table 9 or Table 10. 

62. The computer system of claim 58, wherein the ribofunctional locus comprises at least a portion of a polypeptide 
exit tunnel. 
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63. The computer system of claim 62 ; where the exit tunnel is defined by a plurality of residues set forth in Table 8. 
Table 9 or Table 10. 

64. The computer system of claim 47. wherein the ribofunctional locus is defined by a plurality of residues sel forth in 
20 Table 11. Table 12, Table 13. Table 14, Table 15 } Table 16 or Table 17. 

65. The computer system of claim 47. wherein the atomic co-ordinates are produced by molecular modeling. 

66. The computer system of claim 47 or 65 : wherein the atomic co-ordinates arc produced by homology modeling 
25 using at least a portion of the atomic co-ordinates deposited at the Protein Data Bank under accession number 

PDB ID: 1FFK. 1 FF2 ; 1FG0 ; or 1JJ2. 

67. The computer system of claim 47 or 65, wherein the atomic co-ordinates are produced by molecular replacement 
using at least a portion of the atomic co-ordinates deposited at the Protein Data Bank under accession number 

30 PDB ID: 1FFK. 1 FFZ, 1 FGO, or 1JJ2. 

68. The computer system of claim 47. wherein the ribofunctional locus is defined by atoms of a ribosomal RNA. 

69. The computer system of claim 47 or 68, wherein the ribofunctional locus is defined by atoms of a ribosomal protein. 

35 

70. The computer system of claim 47. wherein the atomic co-ordinates define a residue that is present in a ribosome 
of a pathogen but absent from a ribosome of a host organism. 

71. The computer system of claim 70 ; wherein the host organism is a mammal. 

40 

72. The computer system of claim 71 ; wherein the mammal is a human. 

73. The computer system of claim 47. wherein the atomic co-ordinates define residues that are conserved among 
pathogens. 

45 

74. The computer system of claim 47, further comprising a program for performing drug design. 

75. A molecular model produced by the computer system of claim 47. 

50 76. A method of identifying a candidate molecuie : the method comprising the steps of: 

(a) providing a molecular model of a ribofunctional locus of a large subunit of a ribosome, wherein the molecular 
model is based on atoms derived from an electron density map having a resolution of at least about 4.5 A; and 

55 (b) using the model to identify a candidate molecule having a surface complementary to the ribofunctional 

locus. 

77. The method of claim 76. wherein the candidate molecule binds the ribofunctional locus of the large subunit of the 
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ribosome. 

78. The method of claim 76. comprising the additional step of producing the candidate molecule identified in step (b). 

5 79. The method of claim 76 or 78. comprising the additional step of determining whether the candidate molecule 
modulates ribosomal activity. 

80. The method of claim 79. comprising the additional step of identifying a modified molecule. 

10 81. The method of claim 80. comprising the additional step of producing the modified molecule. 

82. The method of claim 81 7 comprising the additional step of determining whether the modified molecule modulates 
ribosomal activity. 

'5 83. The method of claim 82. comprising the additional step of producing the modified molecule. 

84. The method of claim 76. wherein the candidate molecule is an antibiotic or an antibiotic analogue. 

85. The method of claim 80. wherein the modified molecule is an antibiotic or an antibiotic analogue. 

20 

86. The method of claim 84. wherein the antibiotic or antibiotic analogue is a macrolide. 

87. The method of claim 76 T wherein the ribofunctional locus comprises at ieast a portion of an active site. 

25 88. The method of claim 87. wherein the active site comprises at least a portion of a peptidyl transferase site. 

89. The method of claim 87 ; wherein the peptidyl transferase site is defined by a plurality of residues set forth in Table 5. 

90. The method of claim 76. wherein the ribofunctional locus comprises at least a portion of an A-site. 

30 

91. The method of claim 90. wherein the A-site is defined by a plurality of residues set forth in Table 6. 

92. The method of claim 76 or 90. wherein the ribofunctional locus comprises a least a portion of a P-site. 

35 93, The method of claim 92 ; wherein the P-site is defined by a plurality of residues set forth in Table 7. 

94. The method of claim 76 or 90 : wherein the ribofunctional locus comprises at least a portion of a polypeptide exit 
tunnel. „ 

•*o 95. The method of claim 94 ; wherein the exit tunnel is defined by a plurality of residues set forth in Table 8. Table 9 or 
Table 10. 

96. The method of claim 92, wherein the ribofunctional locus comprises at least a portion of a polypeptide exit tunnel. 

45 97. The method of claim 96 : wherein the exit tunnel is defined by a plurality of residues set forth in Table 8. Table 9 or 
Table 10. 

98. The method of claim 76. wherein the ribofunctional locus is defined by a plurality of residues set forth in Table 11 . 
Table 12, Table 13. Table 14, Table 15. Table 16 or Table 17. 
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99. The method of claim 76. wherein the molecular model is in an electronic form. 

1 00. The method of claim 76, wherein the molecular model is generatedfrom atomic co-ordinates produced by molecular 
modeling. 

1 01 The method of claim 76 or 1 00, wherein the molecular model is generated from atomic co-ordinates produced by 
homology modeling using at least a portion of the atomic co-ordinates deposited at the Protein Data Bank under 
accession number PDB ID: 1FFK, 1FFZ. 1FG0. or 1JJ2. 
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102. The method of claim 76 or 100. wherein the molecular model is generated from atomic co-ordinates produced by 
molecular replacement using at least a portion of the atomic co-ordinates deposited at the Protein Data Bank under 
accession number PDB ID: 1FFK. 1FFZ. 1FG0. or 1JJ2. 

103. The method of claim 76. wherein the molecular model comprises residues that are conserved among prokaryotic 
organisms. 

1 04. The method of claim 76 ; wherein the molecular model comprises a residue that is present in a prokaryotic ribosome 
but is absent from a eukaryotic ribosome. 

105. The method of claim 104. wherein the eukaryotic ribosome is a mammalian ribosome. 

106. An protein synthesis inhibitor comprising: 

a first binding domain having a surface that mimics or duplicates a surface of a known first molecule that binds 
with a first contact site in a large ribosomal subunit; and 

a second binding domain having a surface that mimics or duplicates a surface of a known second molecule 
that binds with a second contact site in the ribosomal subuniL 

wherein the first domain is attached to the second domain so as to permit both the first domain and the 
second domain to bind with its respective contact site thereby to disrupt protein synthesis in a ribosomal subunit. 

107Thc inhibitor of claim 106, wherein the first molecule is a first antibiotic. 

25 

108. The inhibitor of claim 106, wherein the first antibiotic binds at least a portion of a ribofunctional locus. 

109. The inhibitor of claim 106 or 107. wherein the second molecule is a second antibiotic. 

30 HO.The inhibitor of claim 109. wherein the second antibiotic binds at least a portion of a ribofunctional locus. 
111. An engineered, synthetic protein synthesis inhibitor, the inhibitor comprising: 

a binding domain having a surface that mimics or duplicates a surface of a known molecule which binds with 
35 a contact site in a ribosomal subunit: and 

an effector domain attached to the binding domain which, upon binding of the binding domain with the contact 
site, occupies a space within or adjacent the ribosomal subunit thereby to disrupt protein synthesis in the 
ribosomal subunit. 
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112.The inhibitor of claim 111 ; wherein the surface of the binding domain mimics or duplicates a surface of a known 
antibiotic which binds with the contact site. 
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Figure 3B 
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Figure 4 
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Figure 16 
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